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TITLE : LAFORA'S DISEASE GENE 
FIELD OF THE INVENTION 

The invention relates to a novel gene, EPM2B, that is involved in 
5 Lafora's disease; the protein, malin, encoded by the gene; and methods of 
diagnosing and treating Lafora's disease. 
BACKGROUND OF THE INVENTION 

Lafora's disease (LD, OMIM 254780) is the most common and severe 
form of adolescent-onset progressive epilepsy. Increasing seizures are 

10 paralleled with an insidious cognitive decline towards dementia, and death 
usually within 10 years of onset (1 ,2). At the cellular level, LD is characterized 
by an endoplasmic reticulum (ER)-associated accumulation (3) of starch-like 
glucose polymers (4) called polyglucosans (or Lafora bodies). Inheritance is 
autosomal recessive with genetic heterogeneity but the clinical presentation is 

15 homogeneous (5). The inventors previously discovered that mutations in the 
EPM2A gene on chromosome 6q24 encoding a dual-specificity phosphatase 
(named Laforin) with a carbohydrate binding domain, cause LD (6,7). 

There is a need in the art to identify other genes involved in Lafora's 
disease to assist in the diagnosis and treatment of Lafora's disease. 

20 SUMMARY OF THE INVENTION 

The present inventors positionally cloned a novel gene, EPM2B, on 
chromosome 6p22.3. It encodes a protein with a putative RING-finger domain 
and 6 NHL-motifs, which are features of complexes designed for ubiqurtin- 
mediated regulation of specific substrates (8,9) and protein-protein 

25 interactions (10-13), respectively. Seventeen distinct DNA sequence 
variations in EPM2B predicted to cause deleterious effects on the protein 
product, named malin, were found to co-segregate with LD in 26 families. 
Both laforin and malin localize to the ER suggesting they operate in a related 
pathway protecting against neuronal polyglucosan accumulation and epilepsy. 

30 The inventors have also isolated and sequenced the canine version of 
EPM2B. 

Accordingly, the present invention provides an isolated nucleic acid 
molecule that is associated with Lafora's disease and having the sequence 
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shown in SEQ ID NO:1 (Figure 6A). The present invention also provides an 
isolated nucleic acid molecule having a sequence shown in SEQ ID NO:3 
(Figure 7A). 

Preferably, the purified and isolated nucleic acid molecule comprises: 
5 (a) a nucleic acid sequence as shown in SEQ ID NO:1 (Figure 6A) 

and SEQ ID NO:3 (Figure 7A), wherein T can also be U; 

(b) a nucleic acid sequence complementary to (a); 

(c) a nucleic acid sequence that has substantial sequence 
homology to a nucleic acid sequence of (a) or (b); 

10 (d) a nucleic acid sequence that is an analog of a nucleic acid 

sequence of (a), (b) or (c); or 

(e) a nucleic acid sequence that hybridizes to a nucleic acid 
sequence of (a), (b), (c) or (d) under stringent hybridization conditions. 

The present invention also includes an isolated protein containing a 

15 zinc finger of the RING type and 6 NHL-repeat domains which is associated 
with Lafora's disease. In a preferred embodiment of the invention, the protein 
has the amino acid sequence as shown in SEQ ID NO:2 (Figure 6B). In 
another embodiment, the protein has the amino acid sequence shown in SEQ 
ID NO:4 (Figure 7B). 

20 As shown in Table 1 , the inventors have found 17 different mutations in 

the EPM2B gene that are associated with Lafora's disease. Accordingly, the 
present invention provides a method of detecting Lafora's disease comprising 
detecting a mutation in the EPM2B gene in a sample from a mammal. In a 
preferred embodiment, the mutation is one listed in Table 1. 

25 Other features and advantages of the present invention will become 

apparent from the following detailed description. It should be understood, 
however, that the detailed description and the specific examples while 
indicating preferred embodiments of the invention are given by way of 
illustration only, since various changes and modifications within the spirit and 

30 scope of the invention will become apparent to those skilled in the art from 
this detailed description. 
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BR1EF DESCRIPTION OF THE DRAWINGS 

The invention will now be described in relation to the drawings in 

which: 

Figure 1. The EPM2B region on 6p22.3. The previous 2.2 Mb critical 

5 interval was delimited by microsatellite markers D6S1567 and BV012568 (15). 
The BV012568 boundary was based on loss of homozygosity in LD 
individuals from an F-C family (LD6; see Fig. 2). The D6S1567 telomeric 
boundary was defined by a recombination occurring between BV012563 and 
D6S1567 in an unaffected sibling of a second consanguineous F-C family (15) 

10 (LD27). A break in the chain of homozygosity of markers in the LD38324 
family allowed the centromeric boundary to be further re-defined to D6S1688 
(Figure 2) The region contains 7 previously annotated genes and the newly 
discovered EPM2B (Figure 6A), which comprised of a single 1188bp coding 
exon. Representative human and mouse cDNA sequences are shown, as are 

15 the putative ATG start and AATAAA polyadenylation signals. The ATG start 
follows an in-frame stop (at position -60) and the corresponding AUG is 
present at the beginning of the predicted ORF. The nucleotide sequence 
surrounding the start (CGCGCCAUGG) has the proposed features of the 
consensus sequence (GCCA/GCCAUGG) of an eukaryotic translation 

20 initiation site (29). EPM2B is predicted to encode a 42.3 kDa (395 aa) protein 
(malin) containing detectable zinc-binding RING-finger and 6 NHL-repeat 
domains (FIGURE 6B). The RING and NHL acronyms arise from descriptions 
of the first proteins identified to carry them, namely, the Really interesting 
New Gene 1 in Homo sapiens (30) and the NCL-1 (1 1)/HT2A (10)/LIN-41 (13) 

25 genes, respectively. It should be noted that RING- and/or NHL- domains 
occur in a variety of proteins which can have one or both of cytoplasmic or 
nuclear localization (8,9,12,18). Malin is the only protein so far described 
having RING and NHL motifs only (there are other proteins with this 
combination but they also have other associated motifs such as RING-B-Box- 

30 coiled-coil domains). The site of a common C332T (P1 1 1 L) polymorphism is 
shown by an askerisk (*) (see Table 1). Malin shares 79%. 80%, and 85% 
homology with the predicted rat (419 aa), mouse (401 aa) and dog (402 aa) 
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proteins, respectively. The variable amino acids were primarily located in the 
carboxy- and amino- ends of the protein and not in the RING finger or NHL 
domains. The microsatellite markers beginning with BV- were generated in 
this study. 

5 Figure 2. Refinement of the EPM2B critical interval by haplotype 

analysis in LD families and mutations in the EPM2B gene, a, The centromeric 
boundary was narrowed to D6S1688 refining the critical region to 840 kb 
based on the loss of homozygosity in both probands in family LD38324. b, 
Sequence analysis of EPM2B identifies a homozygous 76T>A change in 
10 family LD6 (as well as in the LD7, LD27, and LD28 F-C families, Table 1). 
Affected individuals in family LD38324 were found to be homozygous for a 
dinucleotide deletion (1 048-1 049delGA) leading to frame-shift mutation in the 
fifth NHL domain. 

Figure 3. RNA hybridization expression analysis of EPM2B in human 

15 tissues, a, A multiple tissue blot (Clontech) was hybridized with a 557bp 
fragment of the coding region of EPM2B. Two transcripts 2.4 kb and 1.5 kb in 
size were identified in all tissues, b, The same sized transcripts were found in 
tissues from all regions of the brain tested. 

Figure 4. Electron micrograph of brain biopsy material from patient 

20 LD32817 {EPM2B mutation 98T>C). A, axon (note the numerous normal 
neurotransmitter vesicles); S, synapse; LB, Lafora body (large rounded 
structure) composed of a dense accumulation of polyglucosan filaments (PG) 
completely occupying the dendrite. Bar equals 500nm. 

Figure 5. Cellular localization of the malin and laforin proteins, a, Myc- 

25 tagged malin (construct pcDNA3mycEPW2S) forms a distinct reticular pattern 
around the nucleus, as well as within the nucleus. Co-staining with antibody 
against the GRP94 endoplasmic reticulum (ER)-specific marker reveals co- 
localization with malin. b, Co-localization of the cytoplasmic isoform of laforin 
(construct pcDNA3mycEPM2i4 (24)) with the ER-specific marker GRP94. 

30 Figures 6A and B (SEQ ID NOS:1 and 2) provides the human nucleic 

acid and amino acid sequence of EPM2B. 



Figures 7A and B (SEQ ID N0S:3 and 4) provides the canine nucleic 
acid and amino acid sequence for EPM2B. 
DETAILED DESCRIPTION OF THE INVENTION 

Previous studies by the inventors suggested that -70% of LD patients 
5 carry recessive mutations in the EPM2A gene on chromosome 6q24 (6,7,14). 
To identify the causative gene(s) in the remaining patients, the inventors 
initially performed linkage and homozygosity mapping on a subset of the non- 
EPM2A LD families (LD6, LD7, LD27, LD28) originating from a French- 
Canadian (F-C) isolate (15) (Table 1). This approach led to the localization of 

10 a second LD locus (EPM2B) to a 2.2 Mb region on chromosome 6p22.3 
(Figure 1). All affected individuals from these F-C families were found to be 
homozygous for a rare haplotype across the entire critical interval (Figure 2a). 
To further refine the locus, five additional LD families having multiply affected 
siblings were examined using every microsatellite marker that could be 

15 developed from the DNA sequence encompassing the critical region. In four 
families, all affected individuals were homozygous for all markers across the 
critical interval. In one family (LD38324), however, the chain of homozygosity 
in two affected individuals extended only partially into the critical region 
allowing reduction of the EPM2B locus to 840 kb between D6S1688 and 

20 D6S1567 (Figures 1 and 2a). 

I. NUCLEIC ACID MOLECULES OF THE INVENTION 

As hereinbefore mentioned, the present invention relates to isolated 
nucleic acid molecules that are involved in Lafora's disease. The term 
"isolated" refers to a nucleic acid substantially free of cellular material or 

25 culture medium when produced by recombinant DNA techniques, or chemical 
precursors, or other chemicals when chemically synthesized. The term 
"nucleic acid" is intended to include DNA and RNA and can be either double 
stranded or single stranded. 

Broadly stated, the present invention provides an isolated nucleic acid 

30 molecule encoding a protein containing a zinc finger of the RING type in the 
N-terminal portion and 6 NHL-repeat domains in the C-terminal portion which 
is associated with Lafora's disease. The isolated nucleic acid molecule is 



preferably the EPM2B gene associated with Lafora's disease. In an 
embodiment of the invention, the isolated nucleic acid molecule has a 
sequence as shown in SEQ ID NO:1 (Figure 6A) and SEQ ID NO:3 (Figure 
7A). 

5 The nucleic acid sequences shown in SEQ ID NOS:1 and 3 (or Figures 

6A and 7A t respectively) as well as the mutated sequences specified in Table 
1 can be collectively referred to herein as "the nucleic acid molecules of the 
invention". The amino acid sequences shown in SEQ ID NOS:2 and 4 (or 
Figures 6B and 7B, respectively) as well as the mutated sequences specified 
10 in Table 1 can be collectively referred to herein as the "proteins of the 
invention n . 

Preferably, the purified and isolated nucleic acid molecule comprises 
(a) a nucleic acid sequence as shown in SEQ ID NO:1 (Figure 6A) 
and SEQ ID NO:3 (Figure 7A), wherein T can also be U; 
15 (b) a nucleic acid sequence complementary to (a); 

(c) a nucleic acid sequence that has substantial sequence 
homology to a nucleic acid sequence of (a) or (b); 

(d) a nucleic acid sequence that is an analog of a nucleic acid 
sequence of (a), (b) or (c); or 

20 (e) a nucleic acid sequence that hybridizes to a nucleic acid 

sequence of (a), (b), (c) or (d) under stringent hybridization conditions. 

The term "sequence that has substantial sequence homology" means 
those nucleic acid sequences which have slight or inconsequential sequence 
variations from the sequences in (a) or (b), i.e., the sequences function in 

25 substantially the same manner and can be used to detect, study or treat 
Lafora's disease. The variations may be attributable to local mutations or 
structural modifications. Nucleic acid sequences having substantial homology 
include nucleic acid sequences having at least 65%, more preferably at least 
85%, and most preferably 90-95% identity with the nucleic acid sequences as 

30 shown in SEQ ID NO:1 or SEQ ID NO:3. 

The term "sequence that hybridizes" means a nucleic acid sequence 
that can hybridize to a sequence of (a), (b), (c) or (d) under stringent 
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hybridization conditions. Appropriate "stringent hybridization conditions" 
which promote DNA hybridization are known to those skilled in the art, or may 
be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. 
(1989), 6.3.1-6.3.6. For example, the following may be employed: 6.0 x 
5 sodium chloride/sodium citrate (SSC) at about 45°C, followed by a wash of 
2.0 x SSC at SOX; 0.2 x SSC at SOX to 65X; or 2.0 x SSC at 44°C to SOX. 
The stringency may be selected based on the conditions used in the wash 
step. For example, the salt concentration in the wash step can be selected 
from a high stringency of about 0.2 x SSC at SOX. In addition, the 
10 temperature in the wash step can be at high stringency conditions, at about 
65X. 

The term "a nucleic acid sequence which is an analog" means a 
nucleic acid sequence which has been modified as compared to the sequence 
of (a), (b) or (c) wherein the modification does not alter the utility of the 

15 sequence as described herein. The modified sequence or analog may have 
improved properties over the sequence shown in (a), (b) or (c). One example 
of a modification to prepare an analog is to replace one of the naturally 
occurring bases (i.e. adenine, guanine, cytosine or thymidine) of the 
sequence shown in SEQ ID NO:1 or 3, with a modified base such as such as 

20 xanthine, hypoxanthine, 2-aminoadenine, 6-methyl, 2-propyl and other alkyl 
adenines, 5-halo uracil, 5-halo cytosine, 6-aza uracil, 6-aza cytosine and 6- 
aza thymine, pseudo uracil, 4-thiouracil, 8-halo adenine, 8-aminoadenine, 8- 
thiol adenine, 8-thiolalkyl adenines, 8-hydroxyl adenine and other 8- 
substituted adenines, 8-halo guanines, 8 amino guanine, 8-thiol guanine, 8- 

25 thiolalkyl guanines, 8-hydroxyl guanine and other 8-substituted guanines, 
other aza and deaza uracils, thymidines, cytosines, adenines, or guanines, 5- 
trifluoromethyl uracil and 5-trifluoro cytosine. 

Another example of a modification is to include modified phosphorous 
or oxygen heteroatoms in the phosphate backbone, short chain alkyl or 

30 cycloalkyl intersugar linkages or short chain heteroatomic or heterocyclic 
intersugar linkages in the nucleic acid molecule shown in SEQ ID NO:1 or 
SEQ ID NO:3. For example, the nucleic acid sequences may contain 
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phosphorothioates, phosphotriesters, methyl phosphonates, and 
phosphorodithioates. 

A further example of an analog of a nucleic acid molecule of the 
invention is a peptide nucleic acid (PNA) wherein the deoxyribose (or ribose) 
5 phosphate backbone in the DNA (or RNA), is replaced with a polyamide 
backbone which is similar to that found in peptides (P.E. Nielsen, et al 
Science 1991, 254, 1497). PNA analogs have been shown to be resistant to 
degradation by enzymes and to have extended lives in vivo and in vitro. 
PNAs also bind stronger to a complimentary DNA sequence due to the lack of 

10 charge repulsion between the PNA strand and the DNA strand. Other nucleic 
acid analogs may contain nucleotides containing polymer backbones, cyclic 
backbones, or acyclic backbones. For example, the nucleotides may have 
morpholino backbone structures (U.S. Pat. No. 5,034,506). The analogs may 
also contain groups such as reporter groups, a group for improving the 

15 pharmacokinetic or pharmacodynamic properties of nucleic acid sequence. 

It will be appreciated that the invention includes nucleic acid molecules 
encoding truncations of proteins of the invention, and analogs and homologs 
of proteins of the invention and truncations thereof, as described below. It will 
further be appreciated that variant forms of nucleic acid molecules of the 

20 invention which arise by alternative splicing of an mRNA corresponding to a 
cDNA of the invention are encompassed by the invention. 

Isolated and purified nucleic acid molecules having sequences which 
differ from the nucleic acid sequence of the invention due to degeneracy in 
the genetic code are also within the scope of the invention. Such nucleic 

25 acids encode functionally equivalent proteins but differ in sequence from the 
above mentioned sequences due to degeneracy in the genetic code. 

The invention also includes an isolated nucleic molecule that has a 
mutation as compared to the nucleic acid molecule shown in SEQ ID NO:1 
(Figure 6A) or SEQ ID NO:3 (Figure 7A), wherein said mutation is associated 

30 with Lafora's disease. In a preferred embodiment, the mutation is selected 
from one of the mutations shown in Table 1. 



Nucleic acid molecules from the EPM2B gene or mutated forms thereof 
can be isolated by preparing a labelled nucleic acid probe based on all or part 
of the nucleic acid sequences as shown in SEQ ID NO.1 (Figure 6A) and SEQ 
ID NO:3 (Figure 7A) or a mutated sequence shown in Table 1, and using this 
5 labelled nucleic acid probe to screen an appropriate DNA library (e.g. a cDNA 
or genomic DNA library). Nucleic acids isolated by screening of a cDNA or 
genomic DNA library can be sequenced by standard techniques. 

Nucleic acid molecules of the invention can also be isolated by 
selectively amplifying a nucleic acid using the polymerase chain reaction 

10 (PCR) methods and cDNA or genomic DNA. It is possible to design synthetic 
oligonucleotide primers from the nucleic acid molecules as shown in SEQ ID 
NO:1 (Figure 6A) and SEQ ID NO:3 (Figure 7A) or a mutated sequence 
shown in Table 1, for use in PCR. A nucleic acid can be amplified from cDNA 
or genomic DNA using these oligonucleotide primers and standard PCR 

15 amplification techniques. The nucleic acid so amplified can be cloned into an 
appropriate vector and characterized by DNA sequence analysis. It will be 
appreciated that cDNA may be prepared from mRNA, by isolating total cellular 
mRNA by a variety of techniques, for example, by using the guanidinium- 
thiocyanate extraction procedure of Chirgwin et al., Biochemistry, 18, 5294- 

20 5299 (1979). cDNA is then synthesized from the mRNA using reverse 
transcriptase (for example, Moloney MLV reverse transcriptase available 
from Gibco/BRL, Bethesda, MD, or AMV reverse transcriptase available from 
Seikagaku America, Inc., St Petersburg, FL). 

An isolated nucleic acid molecule of the invention which is RNA can be 

25 isolated by cloning a cDNA encoding a novel protein of the invention into an 
appropriate vector which allows for transcription of the cDNA to produce an 
RNA molecule which encodes the malin protein. For example, a cDNA can 
be cloned downstream of a bacteriophage promoter, (e.g. a T7 promoter) in a 
vector, cDNA can be transcribed in vitro with T7 polymerase, and the resultant 

30 RNA can be isolated by standard techniques. 

A nucleic acid molecule of the invention may also be chemically 
synthesized using standard techniques. Various methods of chemically 



synthesizing polydeoxynucleotides are known, including solid-phase synthesis 
which, like peptide synthesis, has been fully automated in commercially 
available DNA synthesizers (See e.g., Itakura et al. U.S. Patent No. 
4,598,049; Caruthers et al. U.S. Patent No. 4,458,066; and Itakura U.S. 
5 Patent Nos. 4,401 ,796 and 4,373,071). 

The initiation codon and untranslated sequences of the nucleic acid 
molecules of the invention may be determined using currently available 
computer software designed for the purpose, such as PC/Gene 
(IntelliGenetics Inc., Calif.). Regulatory elements can be identified using 

10 conventional techniques. The function of the elements can be confirmed by 
using these elements to express a reporter gene which is operatively linked to 
the elements. These constructs may be introduced into cultured cells using 
standard procedures. In addition to identifying regulatory elements in DNA, 
such constructs may also be used to identify proteins interacting with the 

15 elements, using techniques known in the art. 

The sequence of a nucleic acid molecule of the invention may be 
inverted relative to its normal presentation for transcription to produce an 
antisense nucleic acid molecule. Preferably, an antisense sequence is 
constructed by inverting a region preceding the initiation codon or an 

20 unconsented region. In particular, the nucleic acid sequences contained in 
the nucleic acid molecules of the invention or a fragment thereof, preferably a 
nucleic acid sequence shown in SEQ ID NO:1 (Figure 6A) and SEQ ID NO: 3 
(Figure 7A) or a mutated sequence shown in Table 1 may be inverted relative 
to its normal presentation for transcription to produce antisense nucleic acid 

25 molecules. 

The antisense nucleic acid molecules of the invention or a fragment 
thereof, may be chemically synthesized using naturally occurring nucleotides 
or variously modified nucleotides designed to increase the biological stability 
of the molecules or to increase the physical stability of the duplex formed with 
30 mRNA or the native gene e.g. phosphorothioate derivatives and acridine 
substituted nucleotides. The antisense sequences may be produced 
biologically using an expression vector introduced into cells in the form of a 



recombinant plasmid, phagemid or attenuated virus in which antisense 
sequences are produced under the control of a high efficiency regulatory 
region, the activity of which may be determined by the cell type into which the 
vector is introduced. 

5 The invention also provides nucleic acids encoding fusion proteins 

comprising a novel protein of the invention and a selected protein, or a 
selectable marker protein (see below). 
II. NOVEL PROTEINS OF THE INVENTION 

The invention further includes an isolated protein encoded by the 

10 nucleic acid molecules of the invention. Within the context of the present 
invention, a protein of the invention may include various structural forms of the 
primary protein which retain biological activity. 

Broadly stated, the present invention provides an isolated protein 
containing a zinc finger of the RING type in the N-terminal half and 6 NHL- 

15 repeat domains in the C-terminal direction which is associated with Lafora's 
disease. Preferably, the zinc-binding RING-finger motif (C-X2-C-X 16 -C-Xi-H- 
X2-C-X2-C-X14-C-X2-C) is located between residues 26-71 of the malin protein 
shown in Figure 6B. The presence of a RING finger is predictive of an E3 
ubiquitin ligase function. Therefore, in a preferred embodiment, the protein 

20 has a ubiquitin ligase function. 

In a specific embodiment of the invention, the protein has the amino 
acid sequence as shown in SEQ ID NO:2 (Figure 6B). In another 
embodiment, the protein has the amino acid sequence shown in SEQ ID NO:4 
(Figure 7B). 

25 In addition to full length amino acid sequences the proteins of the 

present invention also include truncations of the protein, and analogs, and 
homologs of the protein and truncations thereof as described herein. 
Truncated proteins may comprise peptides of at least fifteen amino acid 
residues. 

30 Analogs of the protein having the amino acid sequence shown in SEQ 

ID NO:2 (Figure 6B) or SEQ ID NO:4 (Figure 7B) and/or truncations thereof 
as described herein, may include, but are not limited to an amino acid 



sequence containing one or more amino acid substitutions, insertions, and/or 
deletions. Amino acid substitutions may be of a conserved or non-conserved 
nature. Conserved amino acid substitutions involve replacing one or more 
amino acids of the proteins of the invention with amino acids of similar charge, 
5 size, and/or hydrophobicity characteristics. When only conserved 
substitutions are made the resulting analog should be functionally equivalent. 
Non-conserved substitutions involve replacing one or more amino acids of the 
amino acid sequence with one or more amino acids which possess dissimilar 
charge, size, and/or hydrophobicity characteristics. 

10 One or more amino acid insertions may be introduced into the amino 

acid sequences shown in SEQ ID NO:2 (Figure 6B) or SEQ ID NO:4 (Figure 
7B). Amino acid insertions may consist of single amino acid residues or 
sequential amino acids ranging from 2 to 15 amino acids in length. For 
example, amino acid insertions may be used to destroy target sequences so 

15 that the protein is no longer active. This procedure may be used in vivo to 
inhibit the activity of a protein of the invention. 

Deletions may consist of the removal of one or more amino acids, or 
discrete portions from the amino acid sequence shown in SEQ ID NO:2 
(Figure 6B) or SEQ ID NO:4 (Figure 7B). The deleted amino acids may or 

20 may not be contiguous. The lower limit length of the resulting analog with a 
deletion mutation is about 10 amino acids, preferably 100 amino acids. 

Analogs of a protein of the invention may be prepared by introducing 
mutations in the nucleotide sequence encoding the protein. Mutations in 
nucleotide sequences constructed for expression of analogs of a protein of 

25 the invention must preserve the reading frame of the coding sequences. 
Furthermore, the mutations will preferably not create complementary regions 
that could hybridize to produce secondary mRNA structures, such as loops or 
hairpins, which could adversely affect translation of the receptor mRNA. 

Mutations may be introduced at particular loci by synthesizing 

30 oligonucleotides containing a mutant sequence, flanked by restriction sites 
enabling ligation to fragments of the native sequence. Following ligation, the 
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resulting reconstructed sequence encodes an analog having the desired 
amino acid insertion, substitution, or deletion. 

Alternatively, oligonucleotide-directed site specific mutagenesis 
procedures may be employed to provide an altered gene having particular 
5 codons altered according to the substitution, deletion, or insertion required. 
Deletion or truncation of a protein of the invention may also be constructed by 
utilizing convenient restriction endonuclease sites adjacent to the desired 
deletion. Subsequent to restriction, overhangs may be filled in, and the DNA 
religated. Exemplary methods of making the alterations set forth above are 

10 disclosed by Sambrook et al (Molecular Cloning: A Laboratory Manual, 2nd 
Ed., Cold Spring Harbor Laboratory Press, 1989). 

The proteins of the invention also include homologs of the amino acid 
sequence shown in SEQ ID NO:2 (Figure 6B) or SEQ ID NO:4 (Figure 7B) 
and/or truncations thereof as described herein. Such homologs are proteins 

15 whose amino acid sequences are comprised of amino acid sequences that 
hybridize under stringent hybridization conditions (see discussion of stringent 
hybridization conditions herein) with a probe used to obtain a protein of the 
invention. Preferably, homologs of a protein of the invention will have a 
tyrosine phosphatase region which is characteristic of the protein. 

20 A homologous protein includes a protein with an amino acid sequence 

having at least 70%, preferably 80-90% identity with the amino acid sequence 
as shown in SEQ ID NO:2 (Figure 6B) or SEQ ID NO:4 (Figure 7B). 

The invention also contemplates isoforms of the proteins of the 
invention. An isoform contains the same number and kinds of amino acids as 

25 a protein of the invention, but the isoform has a different molecular structure. 
The isoforms contemplated by the present invention are those having the 
same properties as a protein of the invention as described herein. 

The invention also includes an isolated protein that has a mutation as 
compared to the amino acid sequence shown in SEQ ID NO:2 (Figure 6B) or 

30 SEQ ID NO:4 (Figure 7B), wherein said mutation is associated with Lafora's 
disease. In a preferred embodiment, the mutation is selected from one of the 
mutations shown in Table 1. 



The present invention also includes a protein of the invention 
conjugated with a selected protein, or a selectable marker protein (see below) 
to produce fusion proteins. Additionally, immunogenic portions of a protein of 
the invention are within the scope of the invention. 
5 The proteins of the invention (including truncations, analogs, mutants 

etc.) may be prepared using recombinant DNA methods. Accordingly, the 
nucleic acid molecules of the present invention having a sequence which 
encodes a protein of the invention may be incorporated in a known manner 
into an appropriate expression vector which ensures good expression of the 

10 protein. Possible expression vectors include but are not limited to cosmids, 
plasmids, or modified viruses (e.g. replication defective retroviruses, 
adenoviruses and adeno-associated viruses), so long as the vector is 
compatible with the host cell used. The expression vectors are "suitable for 
transformation of a host cell", means that the expression vectors contain a 

15 nucleic acid molecule of the invention and regulatory sequences selected on 
the basis of the host cells to be used for expression, which is operatively 
linked to the nucleic acid molecule. Operatively linked is intended to mean 
that the nucleic acid is linked to regulatory sequences in a manner which 
allows expression of the nucleic acid. 

20 The invention therefore contemplates a recombinant expression vector 

of the invention containing a nucleic acid molecule of the invention, or a 
fragment thereof, and the necessary regulatory sequences for the 
transcription and translation of the inserted protein-sequence. Suitable 
regulatory sequences may be derived from a variety of sources, including 

25 bacterial, fungal, or viral genes (For example, see the regulatory sequences 
described in Goeddel, Gene Expression Technology: Methods in Enzymology 
185, Academic Press, San Diego, CA (1990). Selection of appropriate 
regulatory sequences is dependent on the host cell chosen, and may be 
readily accomplished by one of ordinary skill in the art. Examples of such 

30 regulatory sequences include: a transcriptional promoter and enhancer or 
RNA polymerase binding sequence, a ribosomal binding sequence, including 
a translation initiation signal. Additionally, depending on the host cell chosen 
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and the vector employed, other sequences, such as an origin of replication, 
additional DNA restriction sites, enhancers, and sequences conferring 
inducibility of transcription may be incorporated into the expression vector. It 
will also be appreciated that the necessary regulatory sequences may be 
5 supplied by the native protein and/or its flanking regions. 

The invention further provides a recombinant expression vector 
comprising a DNA nucleic acid molecule of the invention cloned into the 
expression vector in an antisense orientation. That is, the DNA molecule is 
operatively linked to a regulatory sequence in a manner which allows for 

10 expression, by transcription of the DNA molecule, of an RIMA molecule which 
is antisense to a nucleotide sequence comprising the nucleotides as shown 
SEQ ID NO:1 or SEQ ID NO:3. Regulatory sequences operatively linked to 
the antisense nucleic acid can be chosen which direct the continuous 
expression of the antisense RNA molecule. 

15 The recombinant expression vectors of the invention may also contain 

a selectable marker gene which facilitates the selection of host cells 
transformed or transfected with a recombinant molecule of the invention. 
Examples of selectable marker genes are genes encoding a protein such as 
G418 and hygromycin which confer resistance to certain drugs, 0- 

20 galactosidase, chloramphenicol acetyltransferase, or firefly luciferase. 
Transcription of the selectable marker gene is monitored by changes in the 
concentration of the selectable marker protein such as p-galactosidase, 
chloramphenicol acetyltransferase, or firefly luciferase. If the selectable 
marker gene encodes a protein conferring antibiotic resistance such as 

25 neomycin resistance transformant cells can be selected with G418. Cells that 
have incorporated the selectable marker gene will survive, while the other 
cells die. This makes it possible to visualize and assay for expression of 
recombinant expression vectors of the invention and in particular to determine 
the effect of a mutation on expression and phenotype. It will be appreciated 

30 that selectable markers can be introduced on a separate vector from the 
nucleic acid of interest. 
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The recombinant expression vectors may also contain genes which 
encode a fusion moiety which provides increased expression of the 
recombinant protein; increased solubility of the recombinant protein; and aid 
in the purification of a target recombinant protein by acting as a ligand in 
5 affinity purification. For example, a proteolytic cleavage site may be added to 
the target recombinant protein to allow separation of the recombinant protein 
from the fusion moiety subsequent to purification of the fusion protein. 

Recombinant expression vectors can be introduced into host cells to 
produce a transformed host cell. The term "transformed host cell" is intended 

10 to include prokaryotic and eukaryotic cells which have been transformed or 
transfected with a recombinant expression vector of the invention. The terms 
"transformed with", "transfected with", "transformation" and "transfection" are 
intended to encompass introduction of nucleic acid (e.g. a vector) into a cell 
by one of many possible techniques known in the art. Prokaryotic cells can 

15 be transformed with nucleic acid by, for example, electroporation or calcium- 
chloride mediated transformation. Nucleic acid can be introduced into 
mammalian cells via conventional techniques such as calcium phosphate or 
calcium chloride co precipitation, DEAE-dextran-mediated transfection, 
lipofectin, electroporation or microinjection. Suitable methods for transforming 

20 and transfecting host cells can be found in Sambrook et al. (Molecular 
Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory 
press (1989)), and other laboratory textbooks. 

Suitable host cells include a wide variety of prokaryotic and eukaryotic 
host cells. For example, the proteins of the invention may be expressed in 

25 bacterial cells such as E. co//, insect cells (using baculovirus), yeast cells or 
mammalian cells. Other suitable host cells can be found in Goeddel, Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San 
Diego, CA(1991). 

The proteins of the invention may also be prepared by chemical 
30 synthesis using techniques well known in the chemistry of proteins such as 
solid phase synthesis (Merrifield, 1964, J. Am. Chem. Assoc. 85:2149- 



2154) or synthesis in homogenous solution (Houbenweyl, 1987, Methods of 

Organic Chemistry, ed. E. Wansch, Vol. 15 I and II, Thieme, Stuttgart). 

Ill APPLICATIONS 

The present invention includes all uses of the nucleic acid molecule 
5 and proteins of the invention including, but not limited to, the preparation of 

antibodies and antisense oligonucleotides, the preparation of experimental 

systems to study EPM2B and mutated forms thereof, the isolation of 

substances that modulate EPM2B expression and/or activity as well as the 

use of the EPM2B nucleic acid sequences and proteins and modulators 
10 thereof in diagnostic and therapeutic applications. Some of the uses are 

further described below. 

A. Diagnostic Applications 

As previously mentioned, the present inventors have determined that 

the gene EPM2B is mutated in people with Lafora's disease. As a result, the 
15 present invention also includes a method of detecting Lafora's disease by 

detecting a mutation in the EPM2B gene or protein. 

(i) Detecting Mutations in the Nucleic Acid Sequence 

In one embodiment, the present invention provides a method for 

detecting Lafora's disease comprising detecting a mutation in the EPM2B 
20 gene in a sample obtained from an animal, preferably a mammal, more 

preferably a human. 

Table 1 summarizes some of the mutations found in EPM2B in patients 

with Lafora's disease. To date, 17 different DNA sequence alterations have 

been found in EPM2B in 26 families including 8 deletions and 1 insertion 
25 leading to frame-shifts, 7 missense, and 1 non-sense change. Screening 

assays can be developed for each of the mutations. 

The most common mutation identified in seven families is a 

homozygous 205C -> G transition resulting in a proline to alanine change in 

the RING-finger domain. Accordingly, in one embodiment, the present 
30 invention provides a method of detecting Lafora's disease comprising 

detecting a C -> G mutation at position 205 in the EPM2B gene (SEQ ID 

NO:1). 
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Another mutation observed in the four consanguineous F-C families 
used in the original linkage study all carried a homozygous 76T>A change 
producing a cysteine-to-serine alteration in one of the 7 conserved cysteine 
residues that are critical for the zinc-binding ability of the RING-finger domain. 
5 Accordingly, in another embodiment, the present invention provides a method 
of detecting Lafora's disease comprising detecting a T A mutation at 
position 76 in the EPM2B gene (SEQ ID NO:1). 

Another mutation was observed in the LD38324 family that was critical 
in refining the EPM2B locus (Figure 2a) which was a homozygous 2-bp 
10 deletion (1 048-1 049delGA) leading to a frame-shift mutation in the fifth NHL- 
domain (Figure 2b). Accordingly, in a further embodiment, the present 
invention provides a method of detecting Lafora's disease comprising 
detecting a deletion of GA at positions 1048 and 1049 in the EPM2B gene 
(SEQ ID NO:1). 

15 One skilled in the art will appreciate that other methods, in addition to 

the ones discussed above and in the examples, can be used to detect 
mutations in the EPM2B gene. For example, in order to isolate nucleic acids 
from the Lafora's disease gene in a sample, one can prepare nucleotide 
probes from the nucleic acid sequences of the invention. In addition, the 

20 nucleic acid probes described herein can also be used. A nucleotide probe 
may be labelled with a detectable marker such as a radioactive label which 
provides for an adequate signal and has sufficient half life such as 32 P, 3 H, 14 C 
or the like. Other detectable markers which may be used include antigens 
that are recognized by a specific labelled antibody, fluorescent compounds, 

25 enzymes, antibodies specific for a labelled antigen, and chemiluminescent 
compounds. An appropriate label may be selected having regard to the rate 
of hybridization and binding of the probe to the nucleotide to be detected and 
the amount of nucleotide available for hybridization. 

Accordingly, the present invention also relates to a method of detecting 

30 the presence of a nucleic acid molecule from the EPM2B gene in a sample 
comprising contacting the sample under hybridization conditions with one or 
more of nucleotide probes which hybridize to the nucleic acid molecules and 



are labelled with a detectable marker, and determining the degree of 
hybridization between the nucleic acid molecule in the sample and the 
nucleotide probes. 

Hybridization conditions which may be used in the methods of the 

5 invention are known in the art and are described for example in Sambrook J, 
Fritch EF, Maniatis T. In: Molecular Cloning, A Laboratory Manual,1989. 
(Nolan C, Ed.), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY. The hybridization product may be assayed using techniques known in 
the art. The nucleotide probe may be labelled with a detectable marker as 

10 described herein and the hybridization product may be assayed by detecting 
the detectable marker or the detectable change produced by the detectable 
marker. 

Prior to hybridizing a sample with DNA probes, the sample can be 
treated with primers that flank the EPM2B gene in order to amplify the nucleic 

15 acid sequences in the sample. The primers used may be the ones described 
in the present application. In addition, the sequence of the EPM2B gene 
provided herein also permits the identification and isolation, or synthesis of 
new nucleotide sequences which may be used as primers to amplify a nucleic 
acid molecule of the invention, for example in the polymerase chain reaction 

20 (PCR) which is discussed in more detail below. The primers may be used to 
amplify the genomic DNA of other species. The PCR amplified sequences 
can be examined to determine the relationship between the genes of various 
species. 

The length and bases of the primers for use in the PCR are selected so 
25 that they will hybridize to different strands of the desired sequence and at 
relative positions along the sequence such that an extension product 
synthesized from one primer when it is separated from its template can serve 
as a template for extension of the other primer into a nucleic acid of defined 
length. Primers which may be used in the invention are oligonucleotides i.e. 
30 molecules containing two or more deoxyribonucleotides of the nucleic acid 
molecule of the invention which occur naturally as in a purified restriction 
endonuclease digest or are produced synthetically using techniques known in 



the art such as for example phosphotriester and phosphodiester methods 
(See Good et al Nucl. Acid Res 4:2157, 1977) or automated techniques (See 
for example, Conolly, B A. Nucleic Acids Res. 15:15(7): 3131, 1987). The 
primers are capable of acting as a point of initiation of synthesis when placed 
5 under conditions which permit the synthesis of a primer extension product 
which is complementary to the DNA sequence of the invention i.e. in the 
presence of nucleotide substrates, an agent for polymerization such as DNA 
polymerase and at suitable temperature and pH. Preferably, the primers are 
sequences that do not form secondary structures by base pairing with other 

10 copies of the primer or sequences that form a hair pin configuration. The 
primer preferably contains between about 7 and 25 nucleotides. 

The primers may be labelled with detectable markers which allow for 
detection of the amplified products. Suitable detectable markers are 
radioactive markers such as P-32, S-35, 1-125, and H-3, luminescent markers 

15 such as chemiluminescent markers, preferably luminol, and fluorescent 
markers, preferably dansyl chloride, fluorcein-5-isothiocyanate, and 4-fluor-7- 
nitrobenz-2-axa-1,3 diazole, enzyme markers such as horseradish 
peroxidase, alkaline phosphatase, p-galactosidase, acetylcholinesterase, or 
biotin. 

20 It will be appreciated that the primers may contain non-complementary 

sequences provided that a sufficient amount of the primer contains a 
sequence which is complementary to a nucleic acid molecule of the invention 
or oligonucleotide fragment thereof, which is to be amplified. Restriction site 
linkers may also be incorporated into the primers allowing for digestion of the 

25 amplified products with the appropriate restriction enzymes facilitating cloning 
and sequencing of the amplified product. 

In an embodiment of the invention a method of determining the 
presence of a nucleic acid molecule of the invention is provided comprising 
treating the sample with primers which are capable of amplifying the nucleic 

30 acid molecule or a predetermined oligonucleotide fragment thereof in a 
polymerase chain reaction to form amplified sequences, under conditions 
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which permit the formation of amplified sequences and, assaying for amplified 
sequences. 

The polymerase chain reaction refers to a process for amplifying a 
target nucleic acid sequence as generally described in Innis et al, Academic 

5 Press, 1990 in Mullis el al., U.S. Pat. No. 4,863,195 and Mullis, U.S. Patent 
No. 4,683,202 which are incorporated herein by reference. Conditions for 
amplifying a nucleic acid template are described in MA Innis and D.H. 
Gelfand, PCR Protocols, A Guide to Methods and Applications M.A. Innis, 
D.H. Gelfand, JJ. Sninsky and T.J. White eds, pp3-12 t Academic Press 1989, 

10 which is also incorporated herein by reference. 

The amplified products can be isolated and distinguished based on 
their respective sizes using techniques known in the art. For example, after 
amplification, the DNA sample can be separated on an agarose gel and 
visualized, after staining with ethidium bromide, under ultra violet (UW) light. 

15 DNA may be amplified to a desired level and a further extension reaction may 
be performed to incorporate nucleotide derivatives having detectable markers 
such as radioactive labelled or biotin labelled nucleoside triphosphates. The 
primers may also be labelled with detectable markers as discussed above. 
The detectable markers may be analyzed by restriction and electrophoretic 

20 separation or other techniques known in the art. 

The conditions which may be employed in the methods of the invention 
using PCR are those which permit hybridization and amplification reactions to 
proceed in the presence of DNA in a sample and appropriate complementary 
hybridization primers. Conditions suitable for the polymerase chain reaction 

25 are generally known in the art. For example, see M.A. Innis and D.H. Gelfand, 
PCR Protocols, A guide to Methods and Applications M.A. Innis, D.H. 
Gelfand, J.J. Sninsky and T.J. White eds, pp3-12, Academic Press 1989, 
which is incorporated herein by reference. Preferably, the PCR utilizes 
polymerase obtained from the thermophilic bacterium Thermus aquatics (Taq 

30 polymerase, GeneAmp Kit, Perkin Elmer Cetus) or other thermostable 
polymerase may be used to amplify DNA template strands. 
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It will be appreciated that other techniques such as the Ligase Chain 
Reaction (LCR) and NASBA may be used to amplify a nucleic acid molecule 
of the invention (Barney in "PCR Methods and Applications" , August 1991, 
Vol. 1(1), page 5, and European Published Application No. 0320308, 
5 published June 14, 1989, and U.S. Serial NO. 5,130,238 to Malek). 
(ii) Detecting the Malin Protein 

In another embodiment, the present invention provides a method for 
detecting Lafora's disease comprising determining if the malin protein is 
present or mutated in a sample from a mammal, preferably a human, 

10 suspected of having Lafora's disease. 

The malin protein of the present invention may be detected in a 
biological sample using antibodies that are specific for malin using various 
immunoassays that are discussed below. Antibodies that only react with 
mutated malin would be useful as diagnostic agents to detect Lafora's 

15 disease. As such, antibodies would be prepared that bind only a mutated 
region of the protein. 

Conventional methods can be used to prepare the antibodies. For 
example, by using a peptide from the malin protein of the invention, polyclonal 
antisera or monoclonal antibodies can be made using standard methods. A 

20 mammal, (e.g., a mouse, hamster, or rabbit) can be immunized with an 
immunogenic form of the peptide which elicits an antibody response in the 
mammal. Techniques for conferring immunogenicity on a peptide include 
conjugation to carriers or other techniques well known in the art. For 
example, the peptide can be administered in the presence of adjuvant. The 

25 progress of immunization can be monitored by detection of antibody titers in 
plasma or serum. Standard ELISA or other immunoassay procedures can be 
used with the immunogen as antigen to assess the levels of antibodies. 
Following immunization, antisera can be obtained and, if desired, polyclonal 
antibodies isolated from the sera. 

30 To produce monoclonal antibodies, antibody producing cells 

(lymphocytes) can be harvested from an immunized animal and fused with 
myeloma cells by standard somatic cell fusion procedures thus immortalizing 
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these cells and yielding hybridoma cells. Such techniques are well known in 
the art, (e.g., the hybridoma technique originally developed by Kohler and 
Milstein (Nature 256, 495-497 (1975)) as well as other techniques such as the 
human B-cell hybridoma technique (Kozbor et al., Immunol. Today 4, 72 
5 (1983)), the EBV hybridoma technique to produce human monoclonal 
antibodies (Cole et al. Monoclonal Antibodies in Cancer Therapy (1985) Allen 
R. Bliss, Inc., pages 77-96), and screening of combinatorial antibody libraries 
(Huse et al., Science 246, 1275 (1989)]. Hybridoma cells can be screened 
immunochemically for production of antibodies specifically reactive with the 

10 peptide and the monoclonal antibodies can be isolated. Therefore, the 
invention also contemplates hybridoma cells secreting monoclonal antibodies 
with specificity for a protein of the invention. 

The term "antibody" as used herein is intended to include fragments 
thereof which also specifically react with a protein, of the invention, or peptide 

15 thereof. Antibodies can be fragmented using conventional techniques and the 
fragments screened for utility in the same manner as described above. For 
example, F(ab') 2 fragments can be generated by treating antibody with 
pepsin. The resulting F(ab') 2 fragment can be treated to reduce disulfide 
bridges to produce Fab' fragments. 

20 Chimeric antibody derivatives, i.e., antibody molecules that combine a 

non human animal variable region and a human constant region are also 
contemplated within the scope of the invention. Chimeric antibody molecules 
can include, for example, the antigen binding domain from an antibody of a 
mouse, rat, or other species, with human constant regions. Conventional 

25 methods may be used to make chimeric antibodies containing the 
immunoglobulin variable region which recognizes a malin protein (see, for 
example, Morrison et al., Proc. Natl Acad. Sci. U.S.A. 81,6851 (1985); 
Takeda et al., Nature 314, 452 (1985), Cabilly et al., U.S. Patent No. 
4,816,567; Boss et al., U.S. Patent No. 4,816,397; Tanaguchi et al., 

30 European Patent Publication EP1 71496; European Patent Publication 
0173494, United Kingdom patent GB 2177096B). 
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Monoclonal or chimeric antibodies specifically reactive with a protein of 
the invention as described herein can be further humanized by producing 
human constant region chimeras, in which parts of the variable regions, 
particularly the conserved framework regions of the antigen-binding domain, 
5 are of human origin and only the hypervariable regions are of non-human 
origin. Such immunoglobulin molecules may be made by techniques known 
in the art, (e.g., Teng et al., Proc. Natl. Acad. Sci. U.S.A., 80, 7308-7312 
(1983); Kozbor et al., Immunology Today, 4, 7279 (1983); Olsson et al., Meth. 
Enzymol., 92, 3-16 (1982)), and PCT Publication WO92/06193 or EP 

10 0239400). Humanized antibodies can also be commercially produced 
(Scotgen Limited, 2 Holly Road, Twickenham, Middlesex, Great Britain.) 

Specific antibodies, or antibody fragments, reactive against a protein of 
the invention may also be generated by screening expression libraries 
encoding immunoglobulin genes, or portions thereof, expressed in bacteria 

15 with peptides produced from the nucleic acid molecules of the present 
invention. For example, complete Fab fragments, VH regions and FV regions 
can be expressed in bacteria using phage expression libraries (See for 
example Ward et al., Nature 341, 544-546: (1989); Huse et al., Science 246, 
1275-1281 (1989); and McCafferty et al. Nature 348, 552-554 (1990)). 

20 Antibodies may also be prepared using DNA immunization. For 

example, an expression vector containing a nucleic acid of the invention (as 
described above) may be injected into a suitable animal such as mouse. The 
protein of the invention will therefore be expressed in vivo and antibodies will 
be induced. The antibodies can be isolated and prepared as described above 

25 for protein immunization. 

The antibodies may be labelled with a detectable marker including 
various enzymes, fluorescent materials, luminescent materials and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, 
biotin, alkaline phosphatase, p-galactosidase, or acetylcholinesterase; 

30 examples of suitable fluorescent materials include umbelliferone, fluorescein, 
fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, 
dansyl chloride or phycoerythrin; an example of a luminescent material 
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includes luminol; and examples of suitable radioactive material include S-35, 
Cu-64, Ga-67, Zr-89, Ru-97, Tc-99m, Rh-105. Pd-109, ln-111, 1-123, 1-125, 
1131, Re-186, Au-198, Au-199, Pb-203, At-211, Pb-212 and Bi-212. The 
antibodies may also be labelled or conjugated to one partner of a ligand 
5 binding pair. Representative examples include avidin-biotin and riboflavin- 
riboflavin binding protein. Methods for conjugating or labelling the antibodies 
discussed above with the representative labels set forth above may be readily 
accomplished using conventional techniques. 

The antibodies reactive against proteins of the invention (e.g. enzyme 

10 conjugates or labelled derivatives) may be used to detect a protein of the 
invention in various samples, for example they may be used in any known 
immunoassays which rely on the binding interaction between an antigenic 
determinant of a protein of the invention and the antibodies. Examples of 
such assays are radioimmunoassays, enzyme immunoassays (e.g. ELISA), 

15 immunofluorescence, immuno-precipitation, latex agglutination, 
hemagglutination, and histochemical tests. Thus, the antibodies may be used 
to identify or quantify the amount of a protein of the invention in a sample in 
order to diagnose the presence of Lafora's disease. 

In a method of the invention a predetermined amount of a sample or 

20 concentrated sample is mixed with antibody or labelled antibody. The amount 
of antibody used in the process is dependent upon the labelling agent chosen. 
The resulting protein bound to antibody or labelled antibody may be isolated 
by conventional isolation techniques, for example, salting out, 
chromatography, electrophoresis, gel filtration, fractionation, absorption, 

25 polyacrylamide gel electrophoresis, agglutination, or combinations thereof. 

The sample or antibody may be insolubilized, for example, the sample 
or antibody can be reacted using known methods with a suitable carrier. 
Examples of suitable carriers are Sepharose or agarose beads. When an 
insolubilized sample or antibody is used protein bound to antibody or 

30 unreacted antibody is isolated by washing. For example, when the sample is 
blotted onto a nitrocellulose membrane, the antibody bound to a protein of the 
invention is separated from the unreacted antibody by washing with a buffer, 
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for example, phosphate buffered saline (PBS) with bovine serum albumin 
(BSA). 

When labelled antibody is used, the presence of malin can be 
determined by measuring the amount of labelled antibody bound to a protein 
5 of the invention in the sample or of the unreacted labelled antibody. The 
appropriate method of measuring the labelled material is dependent upon the 
labelling agent. 

When unlabelled antibody is used in the method of the invention, the 
presence of malin can be determined by measuring the amount of antibody 

10 bound to the protein using substances that interact specifically with the 
antibody to cause agglutination or precipitation. In particular, labelled 
antibody against an antibody specific for a protein of the invention, can be 
added to the reaction mixture. The presence of a protein of the invention can 
be determined by a suitable method from among the already described 

15 techniques depending on the type of labelling agent. The antibody against an 
antibody specific for a protein of the invention can be prepared and labelled 
by conventional procedures known in the art which have been described 
herein. The antibody against an antibody specific for a protein of the 
invention may be a species specific antMmmunoglobulin antibody or 

20 monoclonal antibody, for example, goat anti-rabbit antibody may be used to 
detect rabbit antibody specific for a protein of the invention, 
(iii) Krts 

The reagents suitable for carrying out the methods of the invention may 
be packaged into convenient kits providing the necessary materials, packaged 

25 into suitable containers. Such kits may include all the reagents required to 
detect a nucleic acid molecule or protein of the invention in a sample by 
means of the methods described herein, and optionally suitable supports 
useful in performing the methods of the invention. 

In one embodiment of the invention, the kit includes primers which are 

30 capable of amplifying a nucleic acid molecule of the invention or a 
predetermined oligonucleotide fragment thereof, all the reagents required to 
produce the amplified nucleic acid molecule or predetermined fragment 
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thereof in the polymerase chain reaction, and means for assaying the 
amplified sequences. The kit may also include restriction enzymes to digest 
the PCR products. In another embodiment of the invention the kit contains a 
nucleotide probe which hybridizes with a nucleic acid molecule of the 
5 invention, reagents required for hybridization of the nucleotide probe with the 
nucleic acid molecule, and directions for its use. In a further embodiment of 
the invention the kit includes antibodies of the invention and reagents required 
for binding of the antibody to a protein of the invention in a sample. 

The methods and kits of the present invention may be used to detect 

10 Lafora's disease. Samples which may be tested include bodily materials such 
as blood, urine, serum, tears, saliva, feces, tissues, cells and the like. In 
addition to human samples, samples may be taken from mammals such as 
non-human primates, etc. 

Before testing a sample in accordance with the methods described 

15 herein, the sample may be concentrated using techniques known in the art, 
such as centrifugation and filtration. For the hybridization and/or PCR-based 
methods described herein, nucleic acids may be extracted from cell extracts 
of the test sample using techniques known in the art. 
B. EPM2fi/Malin Modulators 

20 In addition to antibodies and antisense oligonucleotides described 

above, other substances that modulate EPM2B expression or activity may 
also be identified, as well as substances that modulate mutated forms of 
malin. 

(i) Substances that Bind Malin 
25 Substances that affect malin activity can be identified based on their 

ability to bind to malin and/or mutated malin. 

Substances which can bind with the malin of the invention may be 

identified by reacting the malin with a substance which potentially binds to 

malin, and assaying for complexes, for free substance, or for non-complexed 
30 malin, or for activation of malin. In particular, a yeast two hybrid assay system 

may be used to identify proteins which interact with malin (Fields, S. and 
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Song, O., 1989, Nature, 340:245-247). Systems of analysis which also may 
be used include ELISA. 

Accordingly, the invention provides a method of identifying substances 
which can bind with malin, comprising the steps of: 
5 (a) reacting malin and a test substance, under conditions which 

allow for formation of a complex between the malin and the test substance, 
and 

(b) assaying for complexes of malin and the test substance, for free 
substance or for non complexed malin, wherein the presence of complexes 

10 indicates that the test substance is capable of binding malin. 

The malin protein used in the assay may have the amino acid 
sequence shown in SEQ ID NO:2 (Figure 6B) or SEQ ID NO:4 (Figure 7B) or 
may be a mutated protein as shown in Table 1 or may be a fragment, analog, 
derivative, homolog or mimetic thereof as described herein. 

15 Conditions which permit the formation of substance and malin 

complexes may be selected having regard to factors such as the nature and 
amounts of the substance and the protein. 

The substance-protein complex, free substance or non-complexed 
proteins may be isolated by conventional isolation techniques, for example, 

20 salting out, chromatography, electrophoresis, gel filtration, fractionation, 
absorption, polyacrylamide gel electrophoresis, agglutination, or combinations 
thereof. To facilitate the assay of the components, antibody against malin or 
the substance, or labelled malin, or a labelled substance may be utilized. The 
antibodies, proteins, or substances may be labelled with a detectable 

25 substance as described above. 

Malin, or the substance used in the method of the invention may be 
insolubilized. For example, malin or substance may be bound to a suitable 
carrier. Examples of suitable carriers are agarose, cellulose, dextran, 
Sephadex, Sepharose, carboxymethyl cellulose polystyrene, filter paper, ion- 

30 exchange resin, plastic film, plastic tube, glass beads, polyamine-methyl vinyl- 
ether-maleic acid copolymer, amino acid copolymer, ethylene-maleic acid 
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copolymer, nylon, silk, etc. The carrier may be in the shape of, for example, a 
tube, test plate, beads, disc, sphere etc. 

The insolubilized protein or substance may be prepared by reacting the 
material with a suitable insoluble carrier using known chemical or physical 
5 methods, for example, cyanogen bromide coupling. 

The proteins or substance may also be expressed on the surface of a 
cell using the methods described herein. 

The invention also contemplates assaying for an antagonist or agonist 
of the action of malin. 
10 It will be understood that the agonists and antagonists that can be 

assayed using the methods of the invention may act on one or more of the 
binding sites on the protein or substance including agonist binding sites, 
competitive antagonist binding sites, non-competitive antagonist binding sites 
or allosteric sites. 

15 The invention also makes it possible to screen for antagonists that 

inhibit the effects of an agonist of malm. Thus, the invention may be used to 
assay for a substance that competes for the same binding site of malin. 
(ii) Peptide Mimetics 

The present invention also includes peptide mimetics of the malin and 

20 mutated malin proteins of the invention. For example, a peptide derived from 
a the mutated domain of malin will interact directly or indirectly with an 
associated molecule in such a way as to mimic the native binding of the 
mutated protein. Such peptides may include competitive inhibitors, 
enhancers, peptide mimetics, and the like. All of these peptides as well as 

25 molecules substantially homologous, complementary or otherwise functionally 
or structurally equivalent to these peptides may be used for purposes of the 
present invention. 

"Peptide mimetics" are structures which serve as substitutes for 
peptides in interactions between molecules (See Morgan et al (1989), Ann. 
30 Reports Med. Chem. 24:243-252 for a review). Peptide mimetics include 
synthetic structures which may or may not contain amino acids and/or peptide 
bonds but retain the structural and functional features of a peptide, or 



-30- 

enhancer or inhibitor of the invention. Peptide mimetics also include peptoids, 
oligopeptoids (Simon et al (1972) Proc. Natl. Acad. Sci USA 89:9367); and 
peptide libraries containing peptides of a designed length representing all 
possible sequences of amino acids corresponding to a peptide of the 
5 invention. 

Peptide mimetics may be designed based on information obtained by 
systematic replacement of L-amino acids by D-amino acids, replacement of 
side chains with groups having different electronic properties, and by 
systematic replacement of peptide bonds with amide bond replacements. 

10 Local conformational constraints can also be introduced to determine 
conformational requirements for activity of a candidate peptide mimetic. The 
mimetics may include isosteric amide bonds, or D-amino acids to stabilize or 
promote reverse turn conformations and to help stabilize the molecule. Cyclic 
amino acid analogues may be used to constrain amino acid residues to 

15 particular conformational states. The mimetics can also include mimics of 
inhibitor peptide secondary structures. These structures can model the 3- 
dimensional orientation of amino acid residues into the known secondary 
conformations of proteins. Peptoids may also be used which are oligomers of 
N-substituted amino acids and can be used as motifs for the generation of 

20 chemically diverse libraries of novel molecules. 

Peptides of the invention may also be used to identify lead compounds 
for drug development. The structure of the peptides described herein can be 
readily determined by a number of methods such as NMR and X-ray 
crystallography. A comparison of the structures of peptides similar in 

25 sequence, but differing in the biological activities they elicit in target molecules 
can provide information about the structure-activity relationship of the target. 
Information obtained from the examination of structure-activity relationships 
can be used to design either modified peptides, or other small molecules or 
lead compounds that can be tested for predicted properties as related to the 

30 target molecule. The activity of the lead compounds can be evaluated using 
assays similar to those described herein. 
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Information about structure-activity relationships may also be obtained 
from co-crystallization studies. In these studies, a peptide with a desired 
activity is crystallized in association with a target molecule, and the X-ray 
structure of the complex is determined. The structure can then be compared 
5 to the structure of the target molecule in its native state, and information from 
such a comparison may be used to design compounds expected to possess, 
(iii) Drug Screening Methods 

In accordance with one embodiment, the invention enables a method 
for screening candidate compounds for their ability to increase or decrease 

10 the activity of the mutated malin protein. The method comprises providing an 
assay system for assaying malin activity, assaying the activity in the presence 
or absence of the candidate or test compound and determining whether the 
compound has increased or decreased malin activity. Such compounds may 
be useful in treating Lafora's disease. 

15 Accordingly, the present invention provides a method for identifying a 

compound that affects mutated malin protein activity or expression 
comprising: 

(a) incubating a test compound with a malin protein or a nucleic acid 
encoding a malin protein; and 
20 (b) determining an amount of malin protein activity or expression and 

comparing with a control (i.e. in the absence of the test substance), wherein a 
change in the malin protein activity or expression as compared to the control 
indicates that the test compound has an effect on malin protein activity or 
expression. 

25 In accordance with a further embodiment, the invention enables a 

method for screening candidate compounds for their ability to increase or 
decrease expression of a malin protein. The method comprises putting a cell 
with a candidate compound, wherein the cell includes a regulatory region of a 
malin gene operably joined to a reporter gene coding region, and detecting a 

30 change in expression of the reporter gene. 

In one embodiment, the present invention enables culture systems in 
which cell lines which express the mutated malin gene are incubated with 



-32- 

candidate compounds to test their effects on mutated malin expression. Such 
culture systems can be used to identify compounds which upregulate or 
downregulate malin expression or its function, through the interaction with 
other proteins. 

5 Such compounds can be selected from protein compounds, chemicals 

and various drugs that are added to the culture medium. After a period of 
incubation in the presence of a selected test compound(s), the expression of 
mutated malin can be examined by quantifying the levels of malin mRNA 
using standard Northern blotting procedure, as described in the examples 

10 included herein, to determine any changes in expression as a result of the test 
compound. Cell lines transfected with constructs expressing malin can also 
be used to test the function of compounds developed to modify the protein 
expression. 
C. Therapeutic Uses 

15 As previously discussed, the EPM2B gene and malin of the invention is 

likely involved in Lafora's disease. Accordingly, the present invention 
provides a method of treating Lafora's disease comprising of administering to 
a cell or animal in need thereof, an effective amount of agent that modulates 
EPM2B/malin expression and/or activity. The present invention also provides 

20 a use of an agent that modulates EP/W26/malin expression and/or activity to 
treat Lafora's disease or to prepare a medicament to treat Lafora's disease. 

The term "agent that modulates EPM2B/malin expression and/or 
activity" means any substance that can alter the expression and/or activity of 
the mutated EPM2B/malin found in the patient to be consistent with the wild 

25 type EP/Lf2B/malin. Examples of agents which may be used to include 
administering: a nucleic acid molecule encoding wild type EPM2B; the wild 
type malin protein as well as fragments, analogs, derivatives or homologs 
thereof; antibodies; antisense nucleic acids; peptide mimetics; and 
substances isolated using the screening methods described herein that can 

30 correct the mutation to result in EPM2B/maIin levels and/or function consistent 
with a person without the disease. 
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The term "effective amount" as used herein means an amount 
effective, at dosages and for periods of time necessary to achieve the desired 
results. 

The term "animal" as used herein includes all members of the animal 
5 kingdom, including humans. 

In one embodiment, the invention provides a method of treating 
Lafora's disease by administering to a cell or animal an effective amount of an 
agent that modulates the expression or the biological activity of the mutated 
malin protein. The present invention also provides a use of an effective 
10 amount of an agent that modulates the expression or the biological activity of 
the mutated malin protein to treat Lafora's disease or to prepare a 
medicament to treat Lafora's disease. Substances that inhibit the activity of 
mutated malin include peptide mimetics, malin antagonists and certain 
antibodies to malin. Substances that inhibit the expression of the mutated 
15 EPM2B gene include antisense oligonucleotides to a mutated EPM2B nucleic 
acid sequence. 

In accordance with another embodiment, the present invention enables 
gene therapy as a potential therapeutic approach to Lafora's disease, in which 
normal copies of the EPM2B gene are introduced into patients to successfully 

20 code for normal malin protein in several different affected cell types. 

Retroviral vectors can be used for somatic cell gene therapy especially 
because of their high efficiency of infection and stable integration and 
expression. The targeted cells however must be able to divide and the 
expression of the levels of normal protein should be high. The full length 

25 normal EPM2B gene can be cloned into a retroviral vector and driven from its 
endogenous promoter or from the retroviral long terminal repeat or from a 
promoter specific for the target cell type of interest. Other viral vectors which 
can be used include adeno-associated virus, vaccinia virus, bovine papilloma 
virus, or a herpesvirus such as Epstein-Barr virus. Gene transfer could also 

30 be achieved using non-viral means requiring infection in vitro. This would 
include calcium phosphate, DEAE dextran, electroporation, cationic or anionic 
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lipid formulations (liposomes) and protoplast fusion. Although these methods 
are available, many of these are lower efficiency. 

Anti-sense based strategies can be employed to inhibit mutated 
EPM2B gene function and as a basis for therapeutic drug design. The 

5 principle is based on the hypothesis that sequence specific suppression of 
gene expression can be achieved by intracellular hybridization between 
mRNA and a complementary anti-sense species. It is possible to synthesize 
anti-sense strand nucleotides that bind the sense strand of RNA or DNA with 
a high degree of specificity. The formation of a hybrid RNA duplex may 

10 interfere with the processing/transport/translation and/or stability of a target 
mRNA. 

Hybridization is required for an antisense effect to occur. Antisense 
effects have been described using a variety of approaches including the use 
of antisense oligonucleotides, injection of antisense RNA/ DNA and 

15 transfection of antisense RNA expression vectors. 

Therapeutic antisense nucleotides can be made as oligonucleotides or 
expressed nucleotides. Oligonucleotides are short single strands of DNA 
which are usually 15 to 20 nucleic acid bases long. Expressed nucleotides 
are made by an expression vector such as an adenoviral, retroviral or plasmid 

20 vector. The vector is administered to the cells in culture, or to a patient, 
whose cells then make the antisense nucleotide. Expression vectors can be 
designed to produce antisense RNA, which can vary in length from a few 
dozen bases to several thousand. 

Antisense effects can be induced by control (sense) sequences. The 

25 extent of phenotypic changes are highly variable. Phenotypic effects induced 
by antisense are based on changes in criteria such as biological endpoints, 
protein levels, protein activation measurement and target mRNA levels. 
D. Pharmaceutical Compositions 

The above described substances including nucleic acids encoding 

30 EPM2B and mutated EPM2B, malin and mutated malin proteins, antibodies, 
and antisense oligonucleotides as well as other agents that modulate 
£PM28/malin and/or mutated EP/W2B/malin may be formulated into 
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pharmaceutical compositions for administration to subjects in a biologically 
compatible form suitable for administration in vivo. By "biologically compatible 
form suitable for administration in vivo" is meant a form of the substance to be 
administered in which any toxic effects are outweighed by the therapeutic 
5 effects. The substances may be administered to living organisms including 
humans, and animals. 

Administration of a therapeutically active amount of pharmaceutical 
compositions of the present invention is defined as an amount effective, at 
dosages and for periods of time necessary to achieve the desired result. For 

10 example, a therapeutically active amount of a substance may vary according 
to factors such as the disease state, age, sex, and weight of the individual, 
and the ability of the substance to elicit a desired response in the individual. 
Dosage regimes may be adjusted to provide the optimum therapeutic 
response. For example, several divided doses may be administered daily or 

15 the dose may be proportionally reduced as indicated by the exigencies of the 
therapeutic situation. 

An active substance may be administered in a convenient manner such 
as by injection (subcutaneous, intravenous, etc.), oral administration, 
inhalation, transdermal application, or rectal administration. Depending on the 

20 route of administration, the active substance may be coated in a material to 
protect the compound from the action of enzymes, acids and other natural 
conditions which may inactivate the compound. If the active substance is a 
nucleic acid encoding, for example, a modified EPM2B gene may be delivered 
using techniques known in the art. 

25 The compositions described herein can be prepared by per se known 

methods for the preparation of pharmaceutical^ acceptable compositions 
which can be administered to subjects, such that an effective quantity of the 
active substance is combined in a mixture with a pharmaceutical^ acceptable 
vehicle. Suitable vehicles are described, for example, in Remington's 

30 Pharmaceutical Sciences (Remington's Pharmaceutical Sciences, Mack 
Publishing Company, Easton, Pa., USA 1985) or Handbook of 
Pharmaceutical Additives (compiled by Michael and Irene Ash, Gower 
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Publishing Limited, Aldershot, England (1995)). On this basis, the 
compositions include, albeit not exclusively, solutions of the substances in 
association with one or more pharmaceutical^ acceptable vehicles or 
diluents, and may be contained in buffered solutions with a suitable pH and/or 
5 be iso-osmotic with physiological fluids. In this regard, reference can be 
made to U.S. Patent No. 5,843,456. As will also be appreciated by those 
skilled, administration of substances described herein may be by an inactive 
viral carrier. 

E. Experimental Models 

10 The present invention also includes methods and experimental models 

for studying the function of the EPM2B gene and malin protein. Cells, tissues 
and non-human animals that lack the EPM2B gene or partially lack in malin 
expression may be developed using recombinant expression vectors having a 
specific deletion or mutation in the EPM2B gene. A recombinant expression 

15 vector may be used to inactivate or alter the EPM2B gene by homologous 
recombination and thereby create an EPM2B deficient cell, tissue or animal. 

Null alleles may be generated in cells, such as embryonic stem cells by 
deletion mutation. A recombinant EPM2B gene may also be engineered to 
contain an insertion mutation which inactivates EPM2B. Such a construct 

20 may then be introduced into a cell, such as an embryonic stem cell, by a 
technique such as transfection, electroporation, injection etc. Cells lacking an 
intact EPM2B gene may then be identified, for example by Southern blotting, 
Northern Blotting or by assaying for EPM2B using the methods described 
herein. Such cells may then be fused to embryonic stem cells to generate 

25 transgenic non-human animals deficient in EPM2B. Germline transmission of 
the mutation may be achieved, for example, by aggregating the embryonic 
stem cells with early stage embryos, such as 8 cell embryos, in vitro: 
transferring the resulting blastocysts into recipient females and; generating 
germline transmission of the resulting aggregation chimeras. Such a mutant 

30 animal may be used to define specific cell populations, developmental 
patterns and in vivo processes, normally dependent on EPM2B expression. 
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The present invention also includes the preparation of tissue specific knock* 
outs of the EPM2B gene. 

The following non-limiting example is illustrative of the present 
invention: 
5 EXAMPLES 
Example 1 

The inventors examined all available databases and found 7 annotated 
genes in the newly defined critical region on chromosome 6p22.3 (Figure 1). 
Based on their predicted functional characteristics each gene was prioritized 

10 for mutation screening through DNA sequencing of LD patients carrying an 
EPM2B haplotype, but no pathogenic variants were identified. 
Simultaneously, the inventors' analysis led to the discovery of a previously 
uncharacterized apparently single-exon (1188 bp) gene sharing extensive 
sequence identity with orthologous units (with equivalent protein-coding 

15 potential) in other higher vertebrates (75%, 78%, 87% nucleotide identity with 
rat, mouse, and dog, respectively). The human gene, designated as EPM2B, 
also had at its 5'-end all of the proposed features of the consensus sequence 
of an eukaryotic translational initiation site and, at its 3'-end, two putative 
polyadenylation signals (Figure 1). Moreover, expressed sequence tag (EST) 

20 and cDNA data in human and mouse supported that the single-exon unit was 
a bona fide gene. 

EPM2B would be predicted to encode a 395 amino acid (aa) protein 
that the inventors have named malin (mal for seizure in French) containing a 
zinc finger of the RING type in the N-terminal half and 6 NHL-repeat domains 

25 in the C-terminal direction (Figure 1). Specifically, the zinc-binding RING- 
finger motif (C^-C-X^-C-XrH-Xa-C-X^C-Xu-C-Xz-C) was identified (E- 
value of 0.0067) between residues 26-71 of malin consistent with the 
signature sequence (C-X2-C-XM9-C-XL3-H-X2.3-H-X2-C-X448-C-X2-C) of the 
RING-HC type (16,17). The presence of a RING finger is predictive of an E3 

30 ubiquitin ligase function (8,9,18). E3 ligation is the final and specific step of 
the ubiquitin pathway transferring ubiquitin from E2, either directly or through 
adaptor proteins, to a specific substrate(s) to initiate its removal by the 



proteasome system8. The 6 NHL domains (10-13) were predicted on the 
basis of presence of an approximately 44-residue motif rich in glycine and 
hydrophobic amino acids seeded with a cluster of charged residues (Pfam 
detected six trusted matches for NHL domains with E-values ranging from 

5 0.011 to 3.5) (Figure 1). 

Northern-blot analysis indicated EPM2B is present (as at least two 
transcripts 1.5kb and 2.4kb in size) in all tissues examined including specific 
sub-regions of the brain (Figure 3). The observed transcript sizes correspond 
near to the lengths expected between the predicted ATG-start site and the 

10 two different polyadenylation signals (Figure 1). Moreover, the expression 
profile was similar to that observed for EPM2A, both being present in all 
tissues in which Lafora bodies have been observed (2,19). 

The complete coding region of EPM2B was sequenced in a cohort of 
34 LD probands previously shown not to carry mutations in EPM2A. To date, 

15 17 different DNA sequence alterations have been found in EPM2B in 26 
families including 8 deletions and 1 insertion leading to frame-shifts, 7 
missense, and 1 non-sense change (Table 1). These mutations were found in 
families in both homozygous (18) and compound heterozygous (8) recessive 
states. The four consanguineous F-C families used in the original linkage 

20 study all carried a homozygous 76T>A change producing a cysteine-to-serine 
alteration in one of the 7 conserved cysteine residues that are critical for the 
zinc-binding ability of the RING-finger domain (Figure 2b). The most common 
mutation identified (7 families) was a homozygous 205OG transition resulting 
in a proline to alanine change in the RING-finger domain. The LD38324 family 

25 that was critical in refining the EPM2B locus (Figure 2a) carried a 
homozygous 2-bp deletion (1 048-1 049delGA) leading to a frame-shift 
mutation in the fifth NHL-domain (Figure 2b). In total, 88% or the LD families 
can now be accounted for by mutations in EPM2A (48%) and EPM2B (40%). 
The observation of 8 families with no detectable mutations in either of these 

30 genes suggests there could be additional LD loci. 

Among other conditions with polyglucosan accumulation, including 
adult polyglucosan body disease (APBD) (20), LD is unique for the sub- 
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cellular location of inclusions in neuronal dendrites but not axons (21), as is 
show in Figure 4. The physical association of the forming polyglucosan fibrils 
with ER is also specific to LD (3). In APBD, which is caused by mutations in 
the glycogen branching enzyme (20,22,23), polyglucosans are 
5 indistinguishable in size, composition, and number from Lafora bodies, but 
they are located exclusively in the cell soma and axons (20). The presence of 
a seizure phenotype in LD but not in APBD implicates ER-associated dendritic 
accumulations of polyglucosans in the epilepsy of LD. 

To examine the cellular localization of malin the inventors transfected 

10 an epitope-tagged EPM2B construct and found that it did indeed localize at 
the ER and to a lesser extent within the nucleus of cultured cells (Figure 5). 
These results were similar to the cellular localization observed for the two 
alternative transcripts (A and B) of EPM2A, which encode isoforms of laforin 
found in the cytoplasm at the ER (24-26) (Figure 4b) and in the nucleus, 

15 respectively (27). The inventors most recent data implicate loss of function of 
the cytoplasmic form of laforin in LD based on the identification of transcript A- 
specific mutations (LI. et al., manuscript submitted). Moreover, the study of 
murine-Epm2a knockouts (28) and LD patients (1,2) has shown that the ER- 
associated polyglucosan bodies precede or are concomitant, respectively, 

20 with onset of epilepsy. 

Therefore, in the simplest explanation, LD arises due to improper 
clearance, and subsequent accumulation of polyglucosans in dendrites, 
disturbing neuronal synaptic function leading to epileptogenesis. The 
inventors have now shown in transgenic LD mice that laforin contacts 

25 polyglucosans (and not glycogen) providing the first physical link between a 
disease gene product and LD pathology (E.M.C. et al., in preparation). 
Laforin's only other experimentally-validated function is that of a dual- 
specificity phosphatase (24,25), which would predict that there is at least one 
phosphoprotein intermediary through which it acts. Possible candidates could 

30 be the newly discovered EPM2AIP1 laforin-interacting protein (26), or other 
still to be cloned LD gene(s), or any of their interacting proteins. Malin will 
likely be involved via specific protein-protein interaction through its NHL 
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domains followed by ubiquitin-mediated removal of a regulatory target(s), 
contributing a crucial role with laforin to safeguard neurons against Lafora 
bodies and epilepsy. 
METHODS 
5 Samples 

All patients described in this study were formally diagnosed with 
adolescent-onset progressive myoclonus epilepsy based on presence of 
pathognomonic Lafora bodies in biopsies of skin, skeletal muscle, liver or 
brain. Each LD individual and their family members (if available) were 

10 examined for involvement of the EPM2A locus at 6q24 by homozygosity 
mapping, mutation screening, or both. 
Genotyping and Mutation Screening 

Information and amplification conditions on the established (D6S274, 
D6S285, D6S966, D6S1567, D7S1678, D6S1688, D6S1959) and new 

15 (BV012563, BV012730, BV012565, BV012566, BV012568) are found in the 
UniSTS and Entrez Nucleotides database (http://www.ncbi.nlm.nih.gov/). DNA 
Sequence variations were detected by sequencing of PCR-products. To 
screen EPM2B, two sets of primer pairs that amplify overlapping fragments 
were used EPM2B-1 F: (5-ACTGTGACCGTG ACCGAGA-3') and EP/W2S-1R: 

20 (CACACCCCAAGGTAAGGAGA-3'); EPM2B- 2 F : ( 5 ' - 

GACTGCCATGTGGTTGTCAC-3') and EPM2B-2R: (5'- 
AAACAATTCATTAATGGCAGCA-3') (see Figure 1). PCR was performed on 
50 ng of DNA in buffer [75 mM Tris-HCI (pH 8.8), 20 mM (NH4)2S04, 0.01% 
Tween 20, 1.5 mM MgCI2, 1M Betaine, 0.2 mM dNTP, 0.2 uM of each primer, 

25 2.5 Units of Taq Polymerase (MBI Fermentas)]. Cycling conditions were: initial 
denaturation at 94°C for 3 min followed by 35 cycles of denaturation at 94°C 
for 30 sec, annealing at 60°C for 30 sec and extension at 72°C for 30 sec, 
with a 10 min final extension at 72°C. PCR products were purified using 
mircroCLEAN (Microzone Ltd). 3pl (100ng/|jl) of purified PCR product was 

30 used as sequencing template. For all reactions, 1pl (5 pmol) of primer, 1.5pl 
5X sequencing buffer (Applied Biosystems), 1 pi BigDye Terminator v3.1, and 
7.5pl H20 in a 14pl reaction volume were used. Thermocycling (MJ 
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Research, Inc.) conditions were denaturing at 96°C for 30s; annealing at 50° 
for 20s; and extension at 60°C for 4 min; 35 cycles. All reactions were 
subsequently purified using multiscreen-HV filter plates (Millipore) and 
analyzed using an ABI-3700. All sequence variants detected in LD patients 
5 were examined in a collection of 50 (100 chromosomes) randomly selected 
DNA samples. 

Gene identification and Northern blots 

Gene annotation data and EST sequences were mainly obtained from 
the University of California Santa Cruz (UCSC) genome browser 

10 (http://genome.ucsc.edu/) and Celera Genomics (http://www.celera.com/). 
Additional putative genes were annotated using the Genescript algorithm 
(http://tcag.bioinfo.sickkids.on.ca/genescript/) and multi-species VISTA 
(http://www-gsd.lbl.gov/vista/) alignments between human and mouse 
sequence. The RING finger domain was predicted by Pfam, Prosite, 

15 InterProScan, SMART and MotifScan. NHL domains were identified using 
Pfam and InterProScan. EPM2B orthologues were identified using BLASTN 
and BLASTP analyses against the GenBank non-redundant database. 
Prediction of the sub-cellular localization of malin by sequence analysis was 
performed using PSORT II. No significant signal peptide sequence (for 

20 recognition of ER f Golgi complex, lysosome and integral plasma membrane 
proteins), mitochondrial targeting sequence, nuclear localization signal and 
peroxisomal targeting signal was identified. The human multiple-tissue blot I 
and human brain blot II (Clontech) were probed with a { 32 PJdCTP-labeled 
probe that was generated using the primers 5'- 

25 GTCACCATCACCAACGACTG-3' and 5-TGCGAAAGACCATGAGTGAC-3\ 
which amplified a 557bp fragment within the coding region of EPM2B. 
Hybridization and washing conditions were performed according to the 
manufacturer's instructions. 

Sub-cellular localization and Electron Microscopy 

30 The myc-tagged EPM2A transcript A expression construct 

(pcDNA3mycEPM2A), which encodes the cytoplasmic isoform of laforin has 
been described (24). A myc-tagged EPM2B construct was generated 
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(pcDNA3mycEPM2B) using the same general protocols. Full-length EPM2B 
was amplified by PCR from genomic DNA using the (forward) primer (5'- 
ggatccATGgcggccgaagc-3') containing a BamH\ restriction site (underlined) 
and the start codon (uppercase) and a (reverse) primer (5'- 

5 gcggrogc acaattcattaatggcagac-3') containing a Notl site (underlined). This 
product was cloned into the corresponding sites of the mammalian expression 
vector pcDNA3 (Invitrogen). Myc was then introduced, in frame, after 
amplifying from a previous myc-containing vector with 5' Kpnl-tagged and 3' 
BaroHI-tagged primers. pcDNA3mycEPM2A and pcDNA3mycEP/W2fi (2 mg) 

10 were transfected into Cos-7 cells using Lipofectamine-Plus (Invitrogen) and 
exposed to lipid-DNA complex in DMEM (Sigma-Aldrich) for 5 hours. Forty- 
eight hours post-transfection, cultures were rinsed twice in PBS and fixed for 
15 min at -20°C in an acetone:methanol (1:1) mix. They were then stained 
with antibodies against myc-laforin and ER marker GRP94. Cultures were 

15 blocked for 1 hour (10% BSA/PBS) and incubated with anti-Myc and anti-ER 
for 45 min at room temperature. Slides were washed with PBS and incubated 
with secondary antibody (FITC-labeled goat anti-mouse, 1 :400, detectable 
through the green filter; Texas red-labeled donkey anti-goat, 1:400, detectable 
through the red filter; Jackson ImmunoResearch Laboratories) in blocking 

20 solution. Following mounting (Dako Anti-Fade), they were analyzed by 
immunofluorescence light microscopy. For electron microscopic examination 
biopsy material was obtained from the LD patient and placed into chilled 
Universal fixative. Using standard protocols it was then analyzed at the ultra- 
structural level. 

25 While the present invention has been described with reference to what 

are presently considered to be the preferred examples, it is to be understood 
that the invention is not limited to the disclosed examples. To the contrary, the 
invention is intended to cover various modifications and equivalent 
arrangements included within the spirit and scope of the appended claims. 

30 All publications, patents and patent applications are herein 

incorporated by reference in their entirety to the same extent as if each 
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individual publication, patent or patent application was specifically and 
individually indicated to be incorporated by reference in its entirety. 
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Table 1. EPM2B mutations identified in LD patients. 



Family 


Origin 


Nucleotide change 


Predicted LiTect 


LD6 


F-Canada 


Homozygous 76T>A 


C26S (missense RING finger) 


LD7 


F-Canada 


Homozygous 76T>A 


C26S (missense KING ringer) 


LD27 


F-Canada 


Homozygous 76T>A 


C26S (missense RING ringer) 


LD28 


F-Canada 


Homozygous 76T>A 


C26S (missense RING finger) 


LD23102 


Brazil 


Homozygous 205OG 


P69A (missense RING finger) 


LDlaflOO 


Brazil 


Homozygous 205OG 


P69A (missense RING finger) 


LD5922 


Italy 


Homozygous 205OG 


P69A (missense RING finger) 


LDlaf26 


Italy 


Heterozygous 205OG 
838G>A 


P69A (missense RING finger) 
E279K (missense NHL 4) 


LD4I818 


Spain 


Heterozygous 205OG 
468delA 


P69A (missense RING finger) 
G158fsl73 (frameshift) 


LD35180 


F-Canada 


Heterozygous 205OG 
204delC 


P69A (missense RING finger) 
P69fs21 (frameshift RING finger) 


LD29852 


United States 


Heterozygous 205OG 
468-469delAG 


P69 A (missense RING finger) 
G 1 58fsl6 (frameshift) 


LD34477 


India 


Heterozygous 468-469delAG 

OfOLr> I 


Gl58fsl6 (frameshift) 

n??/>V f nonsense NH1 X\ 


LD51 


Brazil 


Homozygous 468-469delAG 


G158fslo (frameshift) 


LDlaflOl 


Italy 


Homozygous 468-469delAG 


G158fsl6 (frameshift) 


LDIaf9 


Yugoslavia 


Heterozygous 992delG 
468-469delAG 


G321fs2 (frameshift NHL 5) 
G158fsl6 (frameshift) 


LDIafl 


Yugoslavia 


Heterozygous 992delG 
468-469delAG 


G321fs2 (frameshift NHL 5) 
G158fsl6 (frameshift) 


LD949 


Bosnia 


Homozygous 992delG 


G32lfs2 (frameshift NHL 5) 


LD38324 


Yugoslavia 


Homozygous 1048-1049delGA 


E340fs40 (frameshift NHL 5) 


LD7635 


Israel 


Homozygous 373-382dell0bp 


TI25fsl03 (frameshift) 


LD628 


Italy 


Homozygous 661-692del32bp 


V16fsl (frameshift NHL 3) 


LD483 


Italy 


Homozygous 260T>C 


L87P (missense) 


LD22830 


Canada 


Homozygous 905A>C 


Q29P (missense NHL 4) 


LD32817 


Pakistan 


Homozygous 98T>C 


F33S (missense RING finger) 


LD25-9 


Saudi Arabia 


Homozygous 892ins2T 


S298fsl5 (frameshift NHL 4) 


LD5487 


Denmark 


Heterozygous 436G>A 
HOOdelT 


D146N (missense NHL 1) 
V362fs20 (frameshift) 


LD5489 


Denmark 


Homozygous HOOdelT 


V362fs20 (frameshift) 



EPM2B mutations have been identified in 26 families. In family LD51, DNA from the proband 
was not available but the parents were both heterozygous carriers of 468-469delAG, which 
5 would be predicted to lead to homozygous frame-shift mutations in the LD child. Ail of the 
mutations detected would affect the putative RING or NHL motifs, or would be predicted to 
lead to a frame-shift or cause drastic structural change in the protein (LD483 carries a 
260T>C nucleotide change which would lead to a leucine to proline alteration). Four silent 
DNA sequence-coding variants were identified. Three of them T312C (H104H), G372C 
10 (G124G) and T1020C (G340G) were present in five, two, and one of 100 control 
chromosomes, respectively. The most common polymorphism detected, C332T (P111L) 
(Figure 1) was observed on 42 of 100 control chromosomes. 
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WE CLAIM : 

1. An isolated nucleic acid molecule encoding a protein with a RING- 
finger domain and 6 NHL-motifs wherein the protein is associated with 

5 Lafora's disease. 

2. A nucleic acid according to claim 1 having a sequence as shown in 
SEQ ID NO:1 (Figure 6A) or SEQ ID NO:3 (Figure 7). 

10 3. An isolated nucleic acid molecule according to claim 1 comprising 

(a) a nucleic acid sequence as shown in SEQ ID NO:1 (Figure 6A) 
and SEQ ID NO:3 (Figure 7A), wherein T can also be U; 

(b) a nucleic acid sequence complementary to (a); 

(c) a nucleic acid sequence that has substantial sequence 
15 homology to a nucleic acid sequence of (a) or (b); 

(d) a nucleic acid sequence that is an analog of a nucleic acid 
sequence of (a), (b) or (c); or 

(e) a nucleic acid sequence that hybridizes to a nucleic acid 
sequence of (a), (b), (c) or (d) under stringent hybridization conditions. 

20 

4. A method of detecting Lafora's disease comprising detecting a 
mutation or deletion in a nucleic acid sequence according to any one of claims 
1 to 3 in a sample from an animal. 

25 5. A method according to claim 4 comprising detecting a C to G change in 
nucelotide number 205 of the sequence shown in SEQ ID NO:1 (Figure 6). 

6. A method according to claim 4 comprising detecting a T -> A mutation 
at position 76 in the EPM2B gene (SEQ ID NO:1). 

30 



7. A method according to claim 4 comprising detecting a deletion of GA at 
positions 1048 and 1049 in the EPM2B gene (SEQ ID NO:1). 
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8. A method according to claim 4 comprising detecting a mutation in the 
EPM2B gene as indicated in Table 1. 

5 9. An isolated protein containing a RING-finger domain and six NHL 
domains which protein is associated with Lafora's disease. 

10. A protein according to claim 9 having the amino acid sequence as 
shown in SEQ.ID.NO.:2 (Figure 6B) or SEQ ID NO:4 (Figure 7). 

10 

11. A method for detecting Lafora's disease comprising detecting a 
mutation in a protein according to any one of claims 9 or 10. 



15 



12. A method according to claim 11 comprising detecting a mutation in the 
EPM2B gene as indicated in Table 1. 
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B&P File No. 9962-51 



ABSTRACT OF THE DISCLOSURE 

A novel gene (EPM2B) that is mutated in people with Lafora's disease 
is described. 
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6c en 



LD6 



D6S1959 
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2 3 2 


BV012S68 
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BV012S68 
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8 
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BV013S6S 
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8V012729 
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BVD1273Q 
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4 
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1 2 
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4 4 
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5 7 



6pter 
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76T>A 



um 



TCGAGAGCAAGG 



Control 




TCGAGTGCAAGG 



LD38J24 
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2 3 12 12 

3 2 13 13 
12 11 0 0 

4 1 2 4 2 4 
2 1 4 2 4 2 
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1048-1049<telG* 




ACCTGGAGT 



Control 

ACCTGAGGAGT 



heart 
brain 
placenta 

lung 
liver 
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kidney 
pancreas 

cerebellum 

cer.cortex 
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spinal cord 
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frontal lobe 
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Fig 6A 



EPM2B gene sequence 

EPM2B ORF is in uppercase 
Transcript : 2134bp 

gcacaggacgcgcc 

ATGGCGGCCGAAGCCTCGGAGAGCGGGCCAGCGCTGCATGAGCTCATGCG 

CGAGGCGGAGATCAGCCTGCTCGAGTGCAAGGTGTGCTTTGAGAAGTTTG 

GCCACCGGCAGCAGCGGCGCCCGCGCAACCTGTCCTGCGGCCACGTGGTC 

TGCCTGGCCTGCGTGGCCGCCCTGGCGCACCCGCGCACTCTGGCCCTCGA 

GTGCCCATTCTGCAGGCGAGCTTGCCGGGGCTGCGACACCAGCGACTGCC 

TGCCGGTGCTGCACCTCATAGAGCTCCTGGGCTCAGCGCTTCGCCAGTCC 

CCGGCCGCCCATCGCGCCGCCCCCAGCGCCCCCGGAGCCCTCACCTGCCA 

CCACACCTTCGGCGGCTGGGGGACCCTGGTCAACCCCACCGGACTGGCGC 

TTTGTCCCAAGACGGGGCGTGTCGTGGTGGTGCACGACGGCAGGAGGCGT 

GTCAAGATTTTTGACTCAGGGGGAGGATGCGCGCATCAGTTTGGAGAGAA 

GGGGGACGCTGCCCAAGACATTAGGTACCCTGTGGATGTCACCATCACCA 

ACGACTGCCATGTGGTTGTCACTGACGCCGGCGATCGCTCCATCAAAGTG 

TTTGATTTTTTTGGCCAGATCAAGCTTGTCATTGGAGGCCAATTCTCCCT 

ACCTTGGGGTGTGGAGACCACCCCTCAGAATGGGATTGTGGTAACTGATG 

CGGAGGCAGGGTCCCTGCACCTCCTGGACGTCGACTTCGCGGAAGGGGTC 

CTTCGGAGAACTGAAAGGTTGCAAGCTCATCTGTGCAATCCCCGAGGGGT 

GGCAGTGTCTTGGCTCACCGGGGCCATTGCGGTCCTGGAGCACCCCCTGG 

CCCTGGGGACTGGGGTTTGCAGCACCAGGGTGAAAGTGTTTAGCTCAAGT 

ATGCAGCTTGTCGGCCAAGTGGATACCTTTGGGCTGAGCCTCTACTTTCC 

CTCCAAAATAACTGCCTCCGCTGTGACCTTTGATCACCAGGGAAATGTGA 

TTGTTGCAGATACATCTGGTCCAGCTATCCTTTGCTTAGGAAAACCTGAG 

GAGTTTCCAGTACCGAAGCCCATGGTCACTCATGGTCTTTCGCATCCTGT 

GGCTCTTACCTTCACCAAGGAGAATTCTCTTCTTGTGCTGGACACAGCAT 

CTCATTCTATAAAAGTCTATAAAGTTGACTGGGGGTGAtgggctggggtg 

ggtccctggaatcagaagcactagtgctgccattaatgaatcgtttaacc 

ctggataagtcacttaaactcatctatccaggcagggataattaaaacca 

tctggcagacttacaaagcttgggacagttattggagattaatctaccat 

ttattgaatgcatactctgtgcaaggaaatttgcaaatattagcttattt 

aatctgtactatccagtgaggtaatttcttcccccccaagatagagtcaa 

gctctgtcacccaggctggagtgcagaagcacgatcacagctcactacag 

tttcaacgtcccccgctcaggtggtccttccacctcagcctcccaagtag 

ctgggaccacaagtgtgcattaccacactcagctaatttttgtattttgg 

cagagatggggtttcaccatgttgcccaggctggtctcaaactcctgagt 

tcaagcaatccaccttcctcggcctcccaaagtactaggagtacaggcat 

agccacttgctcagccataatttttattattaatctcattgtacaagtga 

gaaaactgagacccagagagcttaagtgacttcctcgaggtcatagttac 

ttactgccttagtcccaatttgaattcaattctgattccaaataagtcgc 

gcttaaataagacaacagatgtgggaaaaatatgtgaatgtgtagtgttg 

ctatgtgtactgtctttacaagtagctaattattttagcacaaagatgtg 

caaagaaaggagactttacggagagtccaggagaaaaaggattttgtggt 

ggccatcactttcattcaatttgcgactgctctgatggcacattagatga 

agttactgttgatcctgagttacgtgaataagaaaaacaattgaactgct 

tattaaaaaagtaaacatgt 
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EPM2B protein sequence 

MAAEASESGPALHELMREAEISLLECKVCFEKFGHRQQRRPRNLSCGHVV 

CLACVAALAHPRTLALECPFCRRACRGCDTSDCLPVLHLIELLGSALRQS 

PAAHRAAPSAPGALTCHHTFGGWGTLVNPTGLALCPKTGRVWVHDGRRR 

VKIFDSGGGCAHQFGEKGDAAQDIRYPVDVTITNDCHWVTDAGDRSIKV 

FDFFGQIKLVIGGQFSLPWGVETTPQNGIVVTDAEAGSLHLLDVDFAEGV 

LRRTERLQAHLCNPRGVAVSWLTGAIAVLEHPLALGTGVCSTRVKVFSSS 

MQLVGQVDTFGLSLYFPSKITASAVTFDHQGNVIVADTSGPAILCLGKPE 

EFPVPKPMVTHGLSHPVALTFTKENSLLVLDTASHSIKVYKVDWG 
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Promoter (5') sequence: 

1 OXXMG90C COCXXX3GOOC aaGGCMCE CCPB3CCCCC MQCAAOCCA 

51 A3QCC00003 GOCXXMGQC OOOCMCTIC CCEQOOOCaA GAACCAaGOC 

101 OX3CJQ3000C OOXXXXX^G O^COCBGCAC CMG00000G COXtXEOCC 

151 CAftSQKXXA GCOOCMCAC OCMOOCXXG O0CXBGO00C PCCCCCPCCA 

201 COCMCXXXX: GOCXXMCAC OCMOCXTM CAOOCAGO0C OCGCXXXMC 

251 (XX2GCOCCC GKXXXXXXX OCMCaaXA GOOOOftGOOC CBGCPGCBGC 

301 ACXXMCMG QGACIGCAAA GCGI3^33Cm COOCBGGIGG AACAO0GICT 

351 TcmnTnG cnraaasiT tgcmoctog gogatcqqqg gocaocqcic 

401 GAGOCIGTIT OOOCTCDCDG AAAG0G3W3C O3QOQ0G0QC CGOOOOOCDC 

451 cigock^m; aicAanrna: raaoccroaG oaaaaaaGPG aaGoaaxGA 

501 GOCTCXDCIC O03OQC3CCrC CGC&CTCSGC G0Q0G00CEC 0030033393 

551 AoacxMoa: GoaaoasMA aacroaaaac TGaaooasaG aoctcagsoc 

60i ooacxxrxxr cosacxxaoc: octgaocos aooaaooooc gsooocggoc 

651 O03G00Q0QG AOCEW303QC GU003COGGA GCGG0330QG 0330303 



Coding sequence: 
MG 

701 G3QG0O3^ CGGCGGOGftG O3QG0333CG CT303GGW3C TOCTGOGCXA 

751 (XjCCGKXJTC AGCTIGCTCG AGIGCAAGCT GlUCi'lOGAG MGITO9QOC 

801 A003CCAGCA GCGQCGCXXG OQCAAOcOGC OOGOSGaCA OSIGGIGIGC 

851 CIO3QCTG0S T33O3G00CT GC30QCACOOS 09GA03CIGG OOCIGGWGIG 

901 cxxrnciGC osoosggoct ocxxmrrc GG&caocAGC ®cigocigc 

951 OGCTQCTKA OCIOCT3GB3 CTOCIGQQCT GG303CIGOS aXBGOOOOC 

1001 GCCGOOOOOC QO3CCGCO0C 0030300000 O0CTO03CDC O3GGO30CCT 

1051 03CCTGCCAT OVOGOSITOG GMGCIGQGG GftOOCIGGIC AAO0OCM3G 

1101 G3CK3QCXXT GTGCCCCN>G A003QQCG3G lU^lUGTC&T GCAOS&03GC 

1151 MGAGGOGQG TCftWSadT TCACTOOGGG GSK3SOGOG OCCAICTGIT 

1201 T3GAC3*3Aft3 GGGSK3GCTG CXXMGAC7\T TfiGCTACCXX: CIGSKCT03 

1251 CXCTCACCAA OS^CIGOCAC GIGGTK3PCA OOSACGOCDG OSKXECIOC 

1301 MCAAMIGT TKMTICTT TGQ0C3CAIC AH3CTCCTCA TT3®CaOOV 

1351 GirnXXTIA OCTIGQG3CG TGGfiGAOCAC CXCTCBGttT G33CT0CT3G 

1401 TMCPGAOGC OGBG3CMQG TO3CIQCACC TOCTOSAAGT 030TJLUA 

1451 GBMGH300C TOCAfflGGAC TCRAAMCIG CftAGGICftlC TCIGCMOOC 

1501 GGGftGOGSIG QOOCTGIXXT QGCIQOGG QGOGMTOOG GTOCIGGBGC 

1551 AO0CT00GQG GCIG3G33CT G3GG09GGCA GCAOXXXCT CMGGIGITC 

1601 AGOOC3VACIA TOCAGdGAIT 0330CMGIG GATOULTI'IG GQCTCAGOCT 

1651 CITITTOCOC TCEAGAASAA. CXJQCCKX33C OalGMJLTIT <3HZ2CCPGG 

1701 GCaATOICTT TCTIGOGAT ALT1C I AGT C PQ3CQ3TCCT ATOCTIGQGA 

1751 raGOOIGBGS AATTTOCAGT OCIGAAG00C ATOfltaOOC MGGICITIC 

1801 omrO ClG T U GOOCACCT TCADCMGCA GAA3TCTCIT CTTOIGCIGG 

1851 /OKHGCBGC OOOTOOGBV AAACTCIACA MSCIG&CIG Q3GGIAA 



Fig 7A (cont'd) 



3' VTR: 

TOG 

1901 

1951 AATTAAIOOC 

2001 PZTNW32TTG 

2051 TKS^miG 

2101 Gicrocnac 

2151 TIBGAGAIGA 

2201 caronrAcr 

2251 AATOOCTIGC 

2301 GICTATATCT 

2351 TOG7ITOGA03 

2401 PCXXSIPGTGT 

2451 TIGWGICBC 

2501 AAAG3EAAAC 

2551 A3GAAKmCT 

2601 AAACTACTOC 

2651 GGAAGAOVGrT 

2701 OGfTTOGITCA 

2751 AOGCEaAAOC 

2801 CCOVTIGCAC 

2851 GOVmBGGC 

2901 CTITGACOCA 

2951 TAAGO0Q3AA 



AicrcroaA cooQcs^rcAr 

TOOT00CIGT GCTAGGAAAC 

GAAAACTGSAG ACXX3GMGCT 
TACIGTCACA GTCACAAIOS 
TCCTMGrraG GAE^ACAGAT 
TGTBOSK3 TCTACTCTCT 
TOCAiSGAAA GS^CnTC 
GAOSIGAGIG 
AGTMCTWa MGACTCIOG 

AGITOCTIGC TOCPGAAAGG 
03GCAGAATC ACTITO0CIG 

ACAcronrr ocotmoca 

CAClUb'IAAA. 0CX3CIGDVIA 

GaxAMraacr gctitaatac 

MTICW3SIG AGGTQ3Q03G 



EFCCPCzrnA accctogaig 

TATAACIX30C TGAQGACTT 
A3AATCA#jT CTACCGirrA 
TTK3CAAAIA TEAGCICMC 
TOOOCATCftT TAMtXXMT 
TIMGIGATT CTCTCAAGCT 

GAAcrcraoT CKs\crococ 

GIGfiGAAAAC GOKXaTCT 
CEATTICTCr 
TOCTGAGITC AAfflGGAAAA 

TOOTX5\Acr TCcrosrrcr 

AIOCTrGAAC TOOOCITOGG 
GIOCIXXMC AIMGAACaC 
CTTOICT9GA CAmAGITCA 

acctocbgig GGMS^rascr 

AIAGAACAAG AN3C030CAC 
OCTCITCKEA AICTCB3CN 
TOCT3CCAT TINAA3TICT 
GCTCOCITCA TCCINAGCEA 
CTTCTIGAGr TCAAGTIMG 
NIOT^TCGA NIO3A003AA 
GITCTTDflN AACCGoTITT 



Fig7B 



Met Gly Ala Glu Ala Ala Gly Ser Gly Arg Ala Leu Arg Glu Leu Val 
1 5 10 15 



Arg Glu Ala Glu Val Ser Leu Leu Glu Cys Lys Val Cys Phe Glu Arg 
20 25 30 



Phe Gly His Arg Gin Gin Arg Arg Pro Arg Asn Leu Pro Cys Gly His 
35 40 45 



Val Val Cys Leu Ala Cys Val Ala Ala Leu Ala His Pro Arg Thr Leu 
50 55 60 



Ala Leu Glu Cys Pro Phe Cys Arg Arg Ala Cys Arg Gly Cys Asp Thr 
65 70 75 80 



Ser Asp Cys Leu Pro Val Leu His Leu Leu Glu Leu Leu Gly Ser Ala 
85 90 95 



Leu Arg Pro Ala Pro Ala Ala Pro Arg Ala Ala Pro Arg Ala Ala Pro 
100 105 110 



Cys Ala Pro Gly Ala Leu Ala Cys His His Ala Phe Gly Gly Trp Gly 
115 120 125 



Thr Leu Val Asn Pro Thr Gly Leu Ala Leu Cys Pro Lys Thr Gly Arg 
130 135 140 



Val Val Val Val His Asp Gly Arg Arg Arg Val Lys lie Phe Asp Ser 
145 150 155 160 



Gly Gly Gly Cys Ala His Gin Phe Gly Glu Lys Gly Glu Ala Ala Gin 
165 170 175 



Asp lie Arg Tyr Pro Leu Asp Val Ala Val Thr Asn Asp Cys His Val 
180 185 190 



Val Val Thr Asp Ala Gly Asp Arg Ser lie Lys Val Phe Asp Phe Phe 
195 200 205 



Gly Gin He Lys Leu Val He Gly Asp Gin Phe Ser Leu Pro Trp Gly 
210 215 220 



Fig 7B (cont'd) 



Val Glu Thr Thr Pro Gin Asn Gly Val Val Val Thr Asp Ala Glu Ala 
225 230 235 240 

Gly Ser Leu His Leu Leu Glu Val Asp Phe Ala Glu Gly Ala Leu Gin 
245 250 255 

Aro Thr Glu Lys Leu Gin Gly His Leu Cys Asn Pro Arg Gly Val Ala 
260 265 270 

Val Ser Trp Leu Thr Gly Ala lie Ala Val Leu Glu His Pro Pro Gly 
275 280 285 

Leu Gly Ala Gly Ala Gly Ser Thr Ala Val Lys Val Phe Ser Pro Thr 
290 295 300 

Met Gin Leu lie Gly Gin Val Asp Thr Phe Gly Leu Ser Leu Phe Phe 
305 310 315 3ZQ 

Pro Ser Arg lie Thr Ala Ser Ala Val Thr Phe Asp His Gin Gly Asn 
325 330 335 

Val He Val Ala Asp Thr Ser Ser Gin Ala Val Leu Cys Leu Gly Gin 
340 345 350 

Pro Glu Glu Phe Pro Val Leu Lys Pro He He Thr His Gly Leu Ser 
355 360 365 

His Pro val Ala Leu Thr Phe Thr Lys Glu Asn Ser Leu Leu Val Leu 
370 375 380 

Asp Ser Ala Ala His Ser Val Lys Val Tyr Lys Ala Asp Trp Gly 
385 390 395 
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SEQUENCE LISTING 

<110> Scherer, Stephen W. 

Minassian, Berge A. 

<120> Novel Lafora's Disease Gene 

<130> 9962-51 

<160> 4 

<170> Patentln version 3.1 

<210> 1 

<211> 2134 

<212> DNA 

<213> Homo sapiens 



<400> 1 



gcacaggacg 


cgccatggcg 


gccgaagcct 


cggagagcgg 


gccagcgctg 


catgagctca 


60 


tgcgcgaggc 


ggagatcagc 


ctgctcgagt 


gcaaggtgtg 


ctttgagaag 


tttggccacc 


120 


ggcagcagcg 


gcgcccgcgc 


aacctgtcct 


gcggccacgt 


ggtctgcctg 


gcctgcgtgg 


180 


ccgccctggc 


gcacccgcgc 


actctggccc 


tcgagtgccc 


attctgcagg 


cgagcttgcc 


240 


ggggctgcga 


caccagcgac 


tgcctgccgg 


tgctgcacct 


catagagctc 


ctgggctcag 


300 


cgcttcgcca 


gtccccggcc 


gcccatcgcg 


ccgcccccag 


cgcccccgga 


gccctcacct 


360 


gccaccacac 


cttcggcggc 


tgggggaccc 


tggtcaaccc 


caccggactg 


gcgctttgtc 


420 


ccaagacggg 


gcgtgtcgtg 


gtggtgcacg 


acggcaggag 


gcgtgtcaag 


atttttgact 


480 


cagggggagg 


atgcgcgcat 


cagtttggag 


agaaggggga 


cgctgcccaa 


gacattaggt 


540 


accctgtgga 


tgtcaccatc 


accaacgact 


gccatgtggt 


tgtcactgac 


gccggcgatc 


600 


gctccatcaa 


agtgtttgat 


ttttttggcc 


agatcaagct 


tgtcattgga 


ggccaattct 


660 


ccttaccttg 


gggtgtggag 


accacccctc 


agaatgggat 


tgtggtaact 


gatgcggagg 


720 


cagggtccct 


gcacctcctg 


gacgtcgact 


tcgcggaagg 


ggtccttcgg 


agaactgaaa 


780 


ggttgcaagc 


tcatctgtgc 


aatccccgag 


gggtggcagt 


gtcttggctc 


accggggcca 


840 
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ttgcggtcct ggagcacccc 


ctggccctgg 


ggactggggt ttgcagcacc agggtgaaag 


900 


tgtttagctc 


aagtatgcag 


cttgtcggcc 


aa gtggatac ctttgggctg agcctctact 


960 


ttccctccaa 


aataactgcc 


tccgctgtga 


cctttgatca ccagggaaat gtgattgttg 


1020 


cagatacatc 


tggtccagct 


atcctttgct 


taggaaaacc tgaggagttt ccagtaccga 


1080 


agcccatggt 


cactcatggt 


ctttcgcatc 


ctgtggctct taccttcacc aaggagaatt 


1140 


ctcttcttgt gctggacaca 


gcatctcatt 


ctataaaagt ctataaagtt gactgggggt 


1200 


gatgggctgg ggtgggtccc 


tggaatcaga 


agcactagtg ctgccattaa tgaattgttt 


1260 


aaccctggat 


aagtcactta 


aactcatcta 


tccaggcagg gataattaaa accatctggc 


1320 


agacttacaa agcttgggac 


agttattgga 


gattaatcta ccatttattg aatgcatact 


1380 


ctgtgcaagg aaatttgcaa 


atattagctt 


atttaatctg tactatccag tgaggtaatt 


1440 


tcttcccccc 


caagatagag 


tcaagctctg 


tcacccaggc tggagtgcag aagcatgatc 


1500 


acagctcact 


acagtttcaa 


cgtcccccgc 


tcaggtggtc cttccacctc agcctcccaa 


1560 


gtagctggga 


ccacaagtgt 


gcattaccac 


actcagctaa tttttgtatt ttggcagaga 


1620 


tggggtttca 


ccatgttgcc 


caggctggtc 


tcaaactcct gagttcaagc aatccacctt 


1680 


cctcggcctc 


ccaaagtact 


aggagtacag 


gcatagccac ttgctcagcc ataattttta 


1740 


ttattaatct 


cattgtacaa 


gtgagaaaac 


tgagacccag agagcttaag tgacttcctc 


1800 


gaggtcatag 


ttacttactg 


ccttagtccc 


aatttgaatt caattctgat tccaaataag 


1860 


ttgcgcttaa 


ataagacaac 


agatgtggga 


aaaatatgtg aatgtgtagt gttgctatgt 


1920 


gtactgtctt 


tacaagtagc 


taattatttt 


agcacaaaga tgtgcaaaga aaggagactt 


1980 


tatggagagt 


tcaggagaaa 


aaggattttg 


tggtggccat cactttcatt caatttgcga 


2040 


ctgctctgat 


ggcacattag 


atgaagttac 


tgttgatcct gagttacgtg aataagaaaa 


2100 


acaattgaac 


tgcttattaa 


aaaagtaaac 


atgt 


2134 



<210> 2 

<211> 395 

<212> PRT 

<213> Homo sapiens 



<400> 2 

Met Ala Ala Glu Ala Ser Glu Ser 
1 5 



Gly Pro Ala Leu His Glu Leu Met 
10 15 
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Arg Glu Ala Glu He Ser Leu Leu Glu Cys Lys Val Cys Phe Glu Lys 
20 25 ~ 30 



Phe Gly His Arg Gin Gin Arg Arg Pro Arg Asn Leu Ser Cys Gly His 
35 40 45 



Val Val Cys Leu Ala Cys Val Ala Ala Leu Ala His Pro Arg Thr Leu 
50 55 60 



Ala Leu Glu Cys Pro Phe Cys Arg Arg Ala Cys Arg Gly Cys Asp Thr 
65 70 75 80 



Ser Asp Cys Leu Pro Val Leu His Leu lie Glu Leu Leu Gly Ser Ala 
85 90 95 



Leu Arg Gin Ser Pro Ala Ala His Arg Ala Ala Pro Ser Ala Pro Gly 
100 105 110 



Ala Leu Thr Cys His His Thr Phe Gly Gly Trp Gly Thr Leu Val Asn 
115 120 125 



Pro Thr Gly Leu Ala Leu Cys Pro Lys Thr Gly Arg Val Val Val Val 
130 135 140 



His Asp Gly Arg Arg Arg Val Lys He Phe Asp Ser Gly Gly Gly Cys 
145 150 155 160 



Ala His Gin Phe Gly Glu Lys Gly Asp Ala Ala Gin Asp He Arg Tyr 
165 170 175 



Pro Val Asp Val Thr He Thr Asn Asp Cys His Val Val Val Thr Asp 
180 185 190 



Ala Gly Asp Arg Ser lie Lys Val Phe Asp Phe Phe Gly Gin He Lys 
195 200 205 



Leu Val He Gly Gly Gin Phe Ser Leu Pro Trp Gly Val Glu Thr Thr 
210 215 220 



Pro Gin Asn Gly He Val Val Thr Asp Ala Glu Ala Gly Ser Leu His 
225 230 235 240 



Leu Leu Asp Val Asp Phe Ala Glu Gly Val Leu Arg Arg Thr Glu Arg 
245 250 255 



51 



Leu Gin Ala His Leu Cys Asn Pro Arg Gly Val Ala val Ser Tip Leu 
260 265 270 



Thr Gly Ala He Ala Val Leu Glu His Pro Leu Ala Leu Gly Thr Gly 
275 280 285 



Val Cys Ser Thr Arg Val Lys Val Phe Ser Ser Ser Met Gin Leu Val 
290 295 300 



Gly Gin Val Asp Thr Phe Gly Leu Ser Leu Tyr Phe Pro Ser Lys He 
305 310 315 320 



Thr Ala Ser Ala Val Thr Phe Asp His Gin Gly Asn Val He Val Ala 
325 330 335 



Asp Thr Ser Gly Pro Ala He Leu Cys Leu Gly Lys Pro Glu Glu Phe 
340 345 350 



Pro Val Pro Lys Pro Met Val Thr His Gly Leu Ser His Pro Val Ala 
355 360 365 



Leu Thr Phe Thr Lys Glu Asn Ser Leu Leu Val Leu Asp Thr Ala Ser 
370 375 380 



His Ser He Lys Val Tyr Lys Val Asp Trp Gly 
385 390 395 



<210> 3 

<211> 3008 

<212> DNA 

<213> Canis sp. 

<220> 

<221> CDS 

<222> (698) . . (1897) 

<223* 



<220> 



<221> 
<222> 
<223> 

<220* 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 



misc_feature 
(2692) . . (2692) 
N=any nucleic acid 



mis cofeature 
(2748) . . (2748) 
N=any nucleic acid 



misc_feature 
(2750) . . (2750) 
N=any nucleic acid 



misc_feature 
(2793) . . (2793) 
N=any nucleic acid 



misc_feature 
(2845) . . (2845) 
N=any nucleic acid 



misc_feature 
(2916) . . (2916) 
N=any nucleic acid 
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<221> misc_feature 

<222> (2918) . . (2918) 

<223> N=any nucleic acid 

<220> 

< 2 2 1 > mi sc_f e a t ur e 

<222> (2931) • . (2931) 

<223> N=any nucleic acid 

<220> 

<221> misc_feature 

<222> (2941) . . (2941) 

<223> N«=any nucleic acid 

<220> 

<221> misc_feature 

<222> (2990) . . (2990) 

<223> N=any nucleic acid 



<400> 3 



ccccaaggcc 


cccccggccc ccaggcaacc 


ccaggccccc 


aggcaaccca 


aggccccccg 


60 


gccccaagcc 


ccccaggttc ccggccccaa 


gaaccaagcc 


ccccggcccc 


ccgcccccag 


120 


cacccagcac 


caagcccccg ccccccgccc 


caagcaccca 


gccccagcac 


ccagcccccg 


180 


ccccagcccc 


agccccagca cccagccccc 


gccccagcac 


ccagccccag 


cacccagccc 


240 


ccgccccagc 


cccagccccc gtcccccccc 


ccagcaccca 


gccccagccc 


cagcagcagc 


300 


acccagcagg 


ggactgcaaa gcgtaggcta 


ccccaggtgg 


aacaccgtgt 


tctagttttg 


360 


ctttgccgtt 


tgcagcctgg gcgatcgggg gccaccgctc 


gagcctgttt 


cccgtcgcgg 


420 


aaagcggagc 


cgccccgccc cgccccccgc 


ctgcctgaag 


gtcacgggcc 


tgggcctgcg 


480 


gcgcgcggtg 


cggcccgcga gcgtccgctc 


ccgcgccctc 


cgcagtcagc 


gcccgcccgc 


540 


ccgccggggg 


accgcaggcc gcggccgaga 


ggctgcgcgc 


tgcgcccgcg 


acgtcaggcc 


600 


ccgccccgcc 


ccgccccgcc ccgtgaccgg 


ccccggcccc 


ggccccggcc 


ccggccccgg 


660 



54 - 



accgagcggc gcccgcggga gcggcggcgg ccgcgcg atg ggg gcc gaa gcg gcg 715 

Met Gly Ala Glu Ala Ala 
1 5 

ggg age ggg egg gcg ctg egg gag ctg gtg cgc gag gcc gag gtc age 763 
Gly Ser Gly Arg Ala Leu Arg Glu Leu Val Arg Glu Ala Glu Val Ser 
10 15 20 

ttg etc gag tgc aag gtg tgc ttc gag agg ttc ggc cac cgc cag cag 811 
Leu Leu Glu Cys Lys Val Cys Phe Glu Arg Phe Gly His Arg Gin Gin 
25 30 35 

egg cgc ccg cgc aac ctg ccc tgc ggc cac gtg gtg tgc ctg gcc tgc 859 
Arg Arg Pro Arg Asn Leu Pro Cys Gly His Val Val Cys Leu Ala Cys 
40 45 50 

gtg gcg gcc ctg gcg cac ccg egg acg ctg gcc ctg gag tgc ccc ttc 907 
Val Ala Ala Leu Ala His Pro Arg Thr Leu Ala Leu Glu Cys Pro Phe 
55 60 65 70 

tgc cgc egg gcc tgc cgc ggc tgc gac acc age gac tgc ctg ccg gtg 955 
Cys Arg Arg Ala Cys Arg Gly Cys Asp Thr Ser Asp Cys Leu Pro Val 
75 80 85 

ctt cac etc ctg gag etc ctg ggc teg gcg ctg cgc cca gcc ccc gcc 1003 
Leu His Leu Leu Glu Leu Leu Gly Ser Ala Leu Arg Pro Ala Pro Ala 
90 95 100 

gcc ccc cgc gcc gcc ccc cgc gcc gcc ccc tgc gcc ccg ggc gcc etc 1051 
Ala Pro Arg Ala Ala Pro Arg Ala Ala Pro Cys Ala Pro Gly Ala Leu 
105 110 115 

gcc tgc cat cac gcg ttc gga ggc tgg ggg acc ctg gtc aac ccc acg 1099 
Ala Cys His His Ala Phe Gly Gly Trp Gly Thr Leu Val Asn Pro Thr 
120 125 130 

ggg ctg gcg ctg tgc ccc aag acc ggg egg gtc gtg gtg gtg cac gac 1147 
Gly Leu Ala Leu Cys Pro Lys Thr Gly Arg Val Val Val Val His Asp 
135 140 145 150 

99C agg agg egg gtc aag ate ttt gac tec ggg gga gga tgc gcc cat 1195 
Gly Arg Arg Arg Val Lys He Phe Asp Ser Gly Gly Gly Cys Ala His 
155 160 165 

cag ttt gga gag aag ggg gag get gcc cag gac att agg tac ccc ctg 124 3 
Gin Phe Gly Glu Lys Gly Glu Ala Ala Gin Asp He Arg Tyr Pro Leu 
170 175 180 

gac gtc gcc gtc acc aac gac tgc cac gtg gtt gtc acc gac gcc ggc 1291 
Asp Val Ala Val Thr Asn Asp Cys His Val Val Val Thr Asp Ala Gly 
185 190 195 

gac cgc tec ate aaa gtg ttt gat ttc ttt ggc cag ate aag etc gtc 1339 
Asp Arg Ser He Lys Val Phe Asp Phe Phe Gly Gin He Lys Leu Val 
200 205 210 

att gga gac cag ttt tec tta cct tgg ggc gtg gag acc acc cct cag 1387 
He Gly Asp Gin Phe Ser Leu Pro Trp Gly Val Glu Thr Thr Pro Gin 
215 220 225 230 



55 - 



aat ggg gtc gtg gta act gac gcc gag gca ggg teg ctg cac ctg ctg 1435 
Asn Gly Val Val Val Thr Asp Ala Glu Ala Gly Ser Leu His Leu Leu 
235 240 245 

gaa gtc gac ttt gca gaa gga gcc etc cag agg act gaa aag ctg caa 1483 
Glu Val Asp Phe Ala Glu Gly Ala Leu Gin Arg Thr Glu Lys Leu Gin 
250 255 " 260 

ggt cat ctg tgc aac ccg cga ggg gtg gcc gtg tec tgg etc act ggg 1531 
Gly His Leu Cys Asn Pro Arg Gly Val Ala Val Ser Trp Leu Thr Gly 
265 270 275 

gcc att gcg gtc ctg gag cac cct ccg ggg ctg ggg get ggg gcg ggc 1579 
Ala lie Ala Val Leu Glu His Pro Pro Gly Leu Gly Ala Gly Ala Gly 
280 285 290 

age acc gcc gtg aag gtg ttc age cca act atg cag ctg ate ggc cag 1627 
Ser Thr Ala Val Lys Val Phe Ser Pro Thr Met Gin Leu lie Gly Gin 
295 300 305 310 

gtg gat acc ttt ggg etc age etc ttt ttc ccc tct aga ata acc gcc 1675 
Val Asp Thr Phe Gly Leu Ser Leu Phe Phe Pro Ser Arg lie Thr Ala 
315 320 325 

tec gcc gtg acc ttt gat cac cag ggg aat gtg att gtt gca gat act 1723 
Ser Ala Val Thr Phe Asp His Gin Gly Asn Val lie Val Ala Asp Thr 
330 335 340 

tct agt cag gcc gtc eta tgc ttg gga cag cct gag gaa ttt cca gtc 1771 
Ser Ser Gin Ala Val Leu Cys Leu Gly Gin Pro Glu Glu Phe Pro Val 
345 350 355 

ctg aag ccc ate ate acc cat ggt ctt tec cat cct gtg gca ctg acc 1819 
Leu Lys Pro lie He Thr His Gly Leu Ser His Pro Val Ala Leu Thr 
360 365 370 

ttc acc aag gag aat tct ctt ctt gtg ctg gac agt gca gcc cat tec 1867 
Phe Thr Lys Glu Asn Ser Leu Leu Val Leu Asp Ser Ala Ala His Ser 
375 380 385 390 

gta aaa gtc tac aag get gac tgg ggg taa tggggtgtgg tgggggtcct 1917 
Val Lys val Tyr Lys Ala Asp Trp Gly 
395 

ggaactgeca ctaatccagt ttaaccctgg atgaattaat cccatctctc gaaeggggat 1977 

cattataaet gectgacaga cttataaagg ttgaaggtaa ttattaaaga ataataatga 2037 

agtctaccgt ttattgagtt atgtgctccc tgtgctagga aactttgeaa atattagctc 2097 

agcgtgtcct tacagtggta cccagggagg taatgeccat cattaatccc attttagaga 2157 

tgagaaaact gagacccgag ggtttaagtg attctctgaa ggtcatgttt acttactgtg 2217 

acagtcacaa tgggaactct attctgactc cccaatccct tgctcctaag taggataaca 2277 

gatgtgagaa aacgacagca tgtgtctata tgttgttact gtgtgtactc tctttacagg 2337 

tagctatttc tcttggttgg aegtgeagag aaaggagact ttctagagag ttcaagagga 2397 
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aaaagggtag tgtgatgagc atggacgtga gtgtcattga acttgctggt tctttgatgt 2457 

cacagtaggt agaatgactg tggatccttc aactgccctt gggaaaggta aacatgtctg 2517 

ttgggacctg gatgtcctcc atcataggaa cccaggaaat actagttggt tgctgcagaa 2577 

aggcttgtgt ggacataagt tcaaaactac tgccgaccac cgtacattca cacacctcca 2637 

gtgggagatg gctggaagac agtcctgtga caggtctgca ttcatagaac aagangccgc 2697 

caccgttggt tcacggcaga atgagtttgc ctgcctcttc ataatctgtg ncnacccgaa 2757 

acccttttgt gatagagttt ttctctgtgc catttnaatt tgtcccattg cacacactgt 2817 

tttcccctaa ccagctccct tgatgctnag ctagcattta ggccactggt aaacccctgt 2877 

atacttcttg agttgaagtt aagctttgac ccagataang nctgctttaa tacntgcagt 293 7 

cgantggacc gaataagggg gaaatttcag gtgaggtggc cgggttcttt atnaaccggt 2997 

tttggtttgt a 3008 

<210> 4 
<211> 399 
<212> PRT 
<213> Canis sp. 



<220> 

<22l> misc_feature 

<222> (2692) . . (2692) 

<223> N=any nucleic acid 
<220> 

<221> misc_f eature 

<222> (2748) . . (2748) 

<22 3> N=any nucleic acid 
<220> 

<221> misc_feature 

<222> (2750) . . (2750) 

<223> N=any nucleic acid 
<220> 
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<221> misc_feature 

<222> (2793) . . (2793) 

<223> N=any nucleic acid 
<220> 

<221> misc_feature 

<222> (2845) . . (2845) 

<223> N=any nucleic acid 
<220> 

<221> misc_feature 

<222> (2916) . . (2916) 

<223> N=any nucleic acid 
<220> 

<221> misc_feature 

<222> (2918) . . (2918) 

<223> N=any nucleic acid 
<220> 

<221> misc_feature 

<222> (2931) . . (2931) 

<223> N=any nucleic acid 
<220> 

< 2 2 1 > mi sc_f e a t ur e 

<222> (2941) . . (2941) 

<22 3> N=any nucleic acid 
<220> 

<221> raisc_feature 

<222> (2990) . . (2990) 

<22 3> N=any nucleic acid 

<400> 4 

Met Gly Ala Glu Ala Ala Gly Ser Gly Arg Ala Leu Arg Glu Leu Val 
15 10 15 
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Arg Glu Ala Glu Val Ser Leu Leu Glu Cys Lys Val Cys Phe Glu Arg 
20 25 30 



Phe Gly His Arg Gin Gin Arg Arg Pro Arg Asn Leu Pro Cys Gly His 
35 40 45 



Val Val Cys Leu Ala Cys Val Ala Ala Leu Ala His Pro Arg Thr Leu 
50 55 60 



Ala Leu Glu Cys Pro Phe Cys Arg Arg Ala Cys Arg Gly Cys Asp Thr 
65 70 75 80 



Ser Asp Cys Leu Pro Val Leu His Leu Leu Glu Leu Leu Gly Ser Ala 
85 90 95 



Leu Arg Pro Ala Pro Ala Ala Pro Arg Ala Ala Pro Arg Ala Ala Pro 
100 105 no 



Cys Ala Pro Gly Ala Leu Ala Cys His His Ala Phe Gly Gly Trp Gly 
115 120 125 



Thr Leu Val Asn Pro Thr Gly Leu Ala Leu Cys Pro Lys Thr Gly Arg 
130 135 140 



Val Val Val Val His Asp Gly Arg Arg Arg Val Lys He Phe Asp Ser 
i45 150 155 160 

Gly Gly Gly Cys Ala His Gin Phe Gly Glu Lys Gly Glu Ala Ala Gin 
165 170 175 

Asp He Arg Tyr Pro Leu Asp Val Ala Val Thr Asn Asp Cys His Val 
180 185 190 

Val Val Thr Asp Ala Gly Asp Arg Ser He Lys Val Phe Asp Phe Phe 
195 200 205 

Gly Gin He Lys Leu Val He Gly Asp Gin Phe Ser Leu Pro Trp Gly 
210 215 220 



Val Glu Thr Thr Pro Gin Asn Gly Val Val Val Thr Asp Ala Glu Ala 
225 230 235 240 



Gly Ser Leu His Leu Leu Glu Val Asp Phe Ala Glu Gly Ala Leu Gin 
245 250 255 
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Arg Thr Glu Lys Leu Gin Gly His Leu Cys Asn Pro Arg Gly Val Ala 
260 265 270 

Val Ser Trp Leu Thr Gly Ala He Ala Val Leu Glu His Pro Pro Gly 
275 280 285 



Leu Gly Ala Gly Ala Gly Ser Thr Ala Val Lys Val Phe Ser Pro Thr 
290 295 300 



Met Gin Leu He Gly Gin Val Asp Thr Phe Gly Leu Ser Leu Phe Phe 
305 310 315 320 

Pro Ser Arg He Thr Ala Ser Ala Val Thr Phe Asp His Gin Gly Asn 
325 330 335 

Val He Val Ala Asp Thr Ser Ser Gin Ala Val Leu Cys Leu Gly Gin 
340 345 350 

Pro Glu Glu Phe Pro Val Leu Lys Pro He He Thr His Gly Leu Ser 
355 360 365 



His Pro Val Ala Leu Thr Phe Thr Lys Glu Asn Ser Leu Leu Val Leu 
370 375 380 



Asp Ser Ala Ala His Ser Val Lys Val Tyr Lys Ala Asp Trp Gly 
385 390 395 
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